TTiis Page b Inserted by iPW Opei^^^ 
and is not a part qf the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include Q)ut are. not hmited to): 

. BLACKBORDEBS 

, TEXT GOT OFF AT TOP, BOT^ 

. FADED TEXT . 

. ILLEGIBLE TEXT 

. SKEWED/SLANTED IMAGES 

. COLORED PHOTOS 

. BLACK OR VERY BLACK AM) WHTTE DARK PHOTOS 

. GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As resGaniamg documents jm^/// w^?^ correct images, 
please do not report the images to the 
Image Problem Mailbox, 



THIS PAGE BLANK (U8FT0) 



PCT 



WORLD ; 



rELLECTUAL PROPERri' ORGANIZATION 
InLcmadcnil Bureau 



INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(SI) Internationa! Patent CbssiQcailon 6 ; 






(11) International Publication Number: 


wo 97/35966 


C12N 15/00, C12Q 1768 




Al 










(43) Intfirnational Publication Date: 


2 October 1997 {02.10.97) 



(21) International Application Number: PCT/l'S97/0-l7I5 

(22) International Filing Date: 20 N!arth 1097 (20.03.97) 



(30) Priority Data: 
OS/621.430 
03/62 1. S59 
03/650.400 



25 March 1996 (25.03.96) L'S 
25 March 1996 (25,03.96) L'S 
20 May 1996 (20.05.96) US 



(60) Parent Applicationi" or Grants 
(63) Rc!a:cd by Continuaiion 
US 

Filed or. 
US 

Fil;J CD 
US 

Fil::J on 



GS/621.430 (CrPj 
25 NIarch 1996 (25.03.96) 

0S;621.359 (CIP) 
25 Marrh 1996 (25.03.96; 

03.'650.-CO (CIP) 
20 Mav 1996 ;:0.05.96) 



Fnncisco. CA 94117 fUS), STEMMER. WiMcm. P.. C. 
[.N'L'USI; ICS Kaihy Court. Los Cares. CA 95030 lUS). 

(74) Agents: FITTS. Rcncc. A. ci al.; Townscnd and To-^r.scnd 
and Crew L.L.P.. 8th floor. Two Embarcadcro Ccnirr. San 
Francisco. CA 941 11.3S54 (US). 



(SI) Designated Stales: AL. AM. AT. AU. AZ. BA. BB. BG. BR. 
BV. CA. CH. CN. CU. CZ, DE. DK. EE. ES, FI. GB. GE 
HU. IL. IS. P. KE. KG. KP. KR. K2. LC. LK. LR. LS. LT. 
LU. LV. .MD. MG. MK. NLV. Nrvv, MX. NO. N2. PL. PT. 
RO. RU. SD. SE. SC. SI, SK. TJ. TM. TR. TT. UA. UG. 
US. UZ. VN. YU. ARIPO patent (CH, KE. LS. .MVV SD. 
S2. UG). EurasixT patent (A.M. AZ. BY. KG. KZ. .MD. RL'. 
TJ. TM). European pa:tnt (AT. BE. CH. DE, DK. ES. FI. 
FR, GB. GR, IE. IT. LU. .^!C. NL. PT. SE). OAPI ^y.znt 
{Bf. U;. Cr. CG. C(. CM. GA. CN. ML. WR, SE. SS TD 
TG). 



(71) Applicant ffcr a!! dc::qn^:^J S:a:cs ezccpi US;: NlA.VVG^N. 
I.VC. (US-USl; 3410 Ccr.[.-il E^prrsswav. Si,n:j CLi.'j, C.\ 
95051 fUS). 



With imttrnationrJ iCJrsh report. 



(72) Invenlors; and 
(75) In vc n tor V.-\p plica nLs (for L'S 
[Oil US}: Apar.rr.cn: No. 



r.:y): MiNSiiULL 
. II 30 Sl:2Sz: S': 



I (54) Tide: .VHmIODS AND CO.M?05!TiONS rCK CGLLU:,.\P A.'O METABOLIC ENGiNLH.^. ING 
! (57) Ahstrjcl 



FOR THE PCRFOSES Of IS FORM AT IDS OSLY 



Cc-iz: u^*i ;o Idcr.iify Su:cs c^r.y \o ?CT cn ih: frcr.i 







r.s 




AM 




Fi 




AT 




FK 




AU 


A',^:r»l;» 


CA 


G»bcn 


Al 


Aurtt;>L,T 


GB 




BA 




CE 


CcctU 


BB 




Gil 




BE 




CN 


Cuinci 


BF 




GK 


Crrcrc 


BG 


Byt(i.-u 


IIU 




DJ 




IE 


Irtlird 


BR 




[L 




DY 




IS 


Iccli-vl 


CA 




IT 


l:iJy 


CF 


Ccncd Ahicm Republic 


JP 




CC 


Cciijo 


KE 


Kcny» 


CM 




KG 


Kyrfyiiitn 


CI 


Ct<c d'Uoin 


KI' 


Dcmocniic Pcopic 


CM 


Cfc-T.cfocn 




R:publ'< of Korci 


CN 




KR 


Rcpubt'c of Korci 


cv 


Cub» 


KZ 




cz 


C:«:>i Republic 


LC 


S^nt LucU 


DE 


CcrrriLiy 


LI 


Licc^acaitcin 


DK 




LK 


Sfi Ur.ki 


EE 




LR 


Libcrii 



c: pjmph!c[S publishing ir.t:mj:icnil applications under the PCT. 



L5 




SI 






LT 




SK 


S'.ov»ii) 




LL' 




SN 


S<ncjil 




LV 




SZ 






•MC 




TO 






MD 


Rtp^jblc of Moldovj 


TG 


T^3 




MC 




TJ 


1'ijiiuiin 




MK 




TM 


TufimcnijiLi 






R:;-jb!< of MiCedonn 


TR 


Tu/W)r 




ML 


V.i!i 


TT 


TnnVlAd iT.d 


TobJjo 


MS- 




UA 






MR 


Mi-jriir.ij 


IG 


LI J o J • 




MSV 


MiJj-i 


ts 




of Ancrvi 


MN 




L'Z 






NT. 




N'N 


Vict Nun 








VU 






>0 






Zimbib'- c 




NZ 


Nc - Zcj!i.-vJ 








PL 










n* 


PjJTjJll 








RO 


R OTi in 1 1 








RU 


Ruii'iin Ftdtniion 








SO 


Sjdin 








SE 










SC 











'''35966 PCT/US97/04715 



10 



15 



20 



METHODS AND COMPOS TTTnKrg 
FOR CET.T.TTT.A J^ AND METABOLir ENGINT^PPTMP, 

This application is a continuation-in-parc of U.S. 
Patent Application Serial No. 08/621,859, filed March 25, 
1996, Serial No. 08/621,430, filed March 25, 1996, Serial No. 
08/425,684 filed, April 18, 1995 and Serial No. 08/650,400, 
filed May 20, 1956, the specifications of which are herein' 
incorporated by reference in t.heir entirety for all purposes. 

Background of the Invention 
Metabolic engineering is the manipulation of 
intermediary metaboiis.-n thro-gh the use of both classical 
genetics and genetic engineering techr.iques. Cellular 
engineering is generally a ~.or-3 inclusive tern referring to 
the r^.odification of cellular properties. Cameron et al . 
(Aoolied Biochem. Biotech. 33:105-140 (1993)) provide a 
su-mary of ecnuivalent terrr.s to describe this type of 
engineering, including "~etaboiic engineering", which is nost 
often used in the context of industrial -icrobiology and 
bicprocess e.ncmeering, - ir. vi trc evolution" or "directed 
25 evolution", rr.ost often used m the context of environnentai ' 
-icrobiology, "r^olecular breeding", ::>,ost often used by 
Japanese researchers, "cellular engineering", which is used to 
describe rr.odif ications of bacteria, anir.al, and plant cells, 
"rational strain develcp~ent " , and "metabolic pathway 
30 evolution". I^, .^^3 application, the terms "metabolic 
engineering" and "cellular engineering" are used 
preferentially for clarity; the term "evolved" genes is used 
as discussed below. 

Metabolic engineering can be divided into two basic 
35 categories: modification o: genes endoge.nous to the host 

organism to alter metacoiite flux a.nd introduction of foreign 
genes into an organism. Such introduction can create new 
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metabolic pathways leading to modified cell properties 
including but not limited to synthesis of known compounds not 
normally made by the host cell, production of novel compounds 
(e.g. polymers, antibiotics, etc.) and the ability to utilize 
new nutrient sources. Specific applications of metabolic 
engineering can include the production of specialty and novel 
chemicals, including antibiotics, extension of the range of 
substrates used for growth and product formation, the 
production of new catabolic activities in an organism for 
toxic chemical degradation, and modification of cell 
properties such as resistance to salt and other environmental 
factors . 

Bailey ( Science 252:1668-1674 (1991)) describes the 
application of metabolic engineering to the recruitment of 
heterologous cenes for the improvement of a strain, with the 
caveat that such introduction can result in new compounds that 
mav subsequently undergo further reactions, or that expression 
of a heterologous protein can result in proteolysis, improper 
folding, improper mod i f ica t icn . or unsuitable intracellular 
location of the protein, or lack of access to required 
s-'bstraces. Bailev reccnr^.ends careful configuration of a 
desired cenetic chan^^e with minimal perrurbation of the host. 

Liao ( Curr. Ocin. Biotech. A:211-216 (1993)) 
reviews mathe"atical nodelling and analysis of metabolic 
pathways, pointing o^i :ha: in many cases the kinetic 
carameters of enzv^es are unavailable or inaccurate. 

Steohanoooulos et al . ( Trends . Siotechnol . 11:392- 
395 (1993)) describe attempcs to improve productivity of 
cellular systems or effect radical alteration of the flux 
t.hrough primary metabolic pathways as having difficulty in 

control architectures at key branch points have evolved 



mat 



to resist flux chances. They conclude thac identification and 
characterization of these metabolic noces is a prerequisite tc 
rational metabolic engineering. Similarly, S tephanopoulos 
( Curr. Qpin. Biotech. 5:195-200 •159;)) concludes that rather 
than modifying the "rate limiting step" in m^etabolic 
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engineering, it is necessary to systematically elucidate the 
control architecture of bioreaction networks. 

The present invention is generally directed to the 
evolution of new metabolic pathways and the enhancement of 
bioprocessing through a process herein termed recursive 
sequence recombination. Recursive sequence recombination 
entails performing iterative cycles of recombination and 
screening or selection to "evolve" individual genes, whole 
plasmids or viruses, multigene clusters, or even whole genomes 
(Stemmer, Bio/Technolocv 13:549-553 (1995)). Such techniques 
do not require the extensive analysis and computation required 
by conventional methods for metabolic engineering. Recursive 
sequence recombination allows the recombination of large 
numbers of mutations in a minimum number of selection cycles, 
in contrast to traditional, pairwise recombination events. 

Thus, because r.e:abolic and cellular engineering 
can pose the particular prc-le- of the interaction of many 
gene products and regulatcry T.echanisms, recursive sequence 
recombination (RSR) techni::.os provide particular advantages 
in that they provide recor± i na r ion between m.utations in any or - 
ail of these, thereby provi-ing a very fast way of exploring 
the manner in which cifferen:: combmacions of m.utations can 
affect a desired resul:, whether that result is increased 
yield o: a metabolite, altered catalytic activity or substrate 
specificity of an entv^e or an entire metabolic pathway, or 
altered response of a cell to its environment. 

Summary of the Invention 
One aspect of the invention is a m.ethod of evolving 
a biocatalytic activity of a cell, comprising: 

(a) recombining at least a first and second DN,:\ 
segment from at least one cer.e conferring ability to catalyze 
a reaction of interest, the :;-ent s di f f ering from each other 
m at least two nucleotide,-, tc produce a library of 
recomiDinant genes; 

(b) screening at least one recombinant gene from 
the library that confers enhanced ability to catalyze the ^ 
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.reaction of interest by the cell relative to a wildtype form 
of the gene; 

(c) recorrjoining at least a segment from at least 
one recombinant gene with a further DNA segment from at least 
one gene, the same or different from the first and second 
segments, to produce a further library of recombinant genes; 

(d) screening at least one further recombinant 
gene from the further library of recombinant genes that 
confers enhanced ability to catalyze the reaction of interest 
in the cell relative to a previous recombinant gene; 

(e) repeating (c) and (d) , as necessary, until the 
further recombinant gene confers a desired level of enhanced 
ability to catalyze the reaction of interest by the cell. 

Another aspecc of the invention is a method of 
evolving a gene to confer ability to. catalyze a reaction of 
interest, the method comprising: 

(1) recombining az lease first and second DNA 
segments from at least one gene conferring ability to catalyze 
a reaction of interest, the segrr.ents differing from each other 
in at least two nucleotides, to produce a library of 

r e comb i n a n t genes; 

(2) screening at least one recorr±)inant gene from 
the library that confers enhanced ability to catalyze a 
reaction of interest relative to a v/ildtype form of the gene; 

(3) recc.-bining at least a segment from the at 
least one recon-Joinant gene with a further DNA segment from the 
at least one gene, the saTie or different from the first and 
second segments, to produce a further library of recombinant 
genes; 

(4) screening at least one further recomJbinant 
gene from the further library of recombinant genes that 
confers enhanced ability to catalyze a reaction of interest 
relative to a previous recorribinan t gene; 

(5) repeating (3) and (4), as necessary, until the 
further recomjoinant gene confers a desired level of enhanced 
ability to catalyze a reaction of interest. 
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A further aipect of the invention is a method of 
generating a new biocatalytic activity in a cell, comprising: 

(1) recombining at least first and second DNA 
segments from at least one gene conferring ability to catalyze 

5 a first reaction related to a second reaction of interest, the 
segments differing from each other in at least two 
nucleotides, to produce a library of recombinant genes; 

(2) screening at least one recombinant gene from 
the library that confers a new ability to catalyze the second 

10 reaction of interest; 



20 



30 



35 



(3) recombining at least a segment from ac least 
one recombinant gene with a further DNA segment from the at 
least one gene, the same or different from the first and 
second segments, co produce a further library of recombinant 

15 genes; 

(4) screening ac least one further recombinant 
gene from the further library c: recombinant genes that 
confers enhanced ability to catalyze the second reaction of 
interest in the cell relative :o a previous recombinant gene; 

\Dj ^ ,3; a.io (4), as necessary, until the 

further recor^bmanc gene confers a desired level of enhanced ' 

seco-d reaction of interest in the 



ability to catalyze r!-- q.^.^^-^- 
c-11 . 



aspect of the invention is a "odified form 

e tabolic 



cell, wherein tne "deification cor^orises a 



>/ay evolved bv rec-rs: 



'e sequence reco;T;bi na t i on 



A t'^rther aspect of the invention is a method of 
optimizing expression of a gene product, the method 
comprising : 

(1) recombining at least first and second DNA 

segments from at least one aene corf->->-^r- shil^r^. ^ ■ 

^.ir.^ aoiiity to proauce 

the gene oroduct, the s^cm^-^^s r-r^^^.-^^ 

' ^ c...e.ing :,rom eacn other m at 

least two nucleotides, to orcd^.-^ a T-:hv--,r-./ r,- >-^^^ ' * 

ci ^^D*ar/ o^ recomjDinant 

genes ; 

(2) screening at least one recombinant gene from 
the library that confers optimized expression of the gene 
product relative to a wildtNnDe form, of the gene; 



.'-■scccio: <v;o gr^SiCaAu 
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(3) recorobining at least a segment from the at 
least one recombinant gene with a further DNA segment from the 
at least one gene, the same or different from the first and 
second segments, to produce a further library of recombinant 

5 genes; 

(4) screening at least one further recombinant 
gene from the further library of recombinant genes that 
confers optimized ability to produce the gene product relative 
to a previous recombinant gene; 

10 (5) repeating (3) and (4), as necessary, until the 

further recombinant gene confers a desired level of optimized 
ability to express the gene product. 

A further aspect of the invention is a method of 
evolving a biosensor for a compound A of interest, the method 

1 5 ' compr is ing : 

(1) recorriDinir.g at lease first and second DNA 
segments from at least one cer.e conferring ability to detect a 
related -comoound B, the segrier.ts differing from each other in 
at least two nucleotides, to produce a library of recombinant 

2 0 genes; 

(2) screening at least one recombinant gene from 
the library that confers optimized ability to detect compound 
A relative to a wilct\pe form of the gene; 

(3) recoTirining at least a segment from the at 

2 5 least one recombinant cer.e wich a further DXA segment from the 
at least one gene, the saT:e or different from the first and 
second secments, to produce a further library of recomjDinant 
genes ; 

(4) screening at least one further recorr±>inant 
30 gene from the further library of recombinant genes that 

confers optimiized ability to decect com.pound A relative to a 
previous recomJoinant gene; 

(5) repeating \2) and (4), as necessary, until the 
further recombinant gene confers a desired level of optimized 

35 ability to detect ccTxOO'Jind A. 



•.SCCCO <V/0 9 735966 At > 



wo 97/35966 



PCT/US97/04715 



Brief D escriptiion of the Drawina.g 
Fig. 1 is a drawing depicting a scheme for in vitro 
recursive sequence recombination. 

Description of the Specific Embodimpn^Q 
The invention provides a number of strategies for 
evolving metabolic and bioprocessing pathways through the 
technique of recursive sequence recombination. One strategy 
entails evolving genes that confer the ability to use a 
particular substrate of interest as a nutrient source in one 

-- ^ ci. 1. j.^iciii. use ol tnat substrate 

in that species, or comparable or more efficient use of chat 
substrate in a second species. Another strategy entails 
evolving genes that confer the ability to detoxify a compound 
of interest in one or r.ore species of organisms. Another 
strategy entails evolving new -etabolic pathways by evolving 
an enzyme or metabolic pathway for biosynthesis or' degradation 
of a compound A related to a cor.oound B for the ability to 
biosynthesize or degrade ccxpour.d B, either in the host of 
origin or a new host. A further strategy entails evolving a 
cene or metabolic pathway for ~::re efficient or optimized' 
expression of a particular metabolite or gene product. A 
rurther strategy entails evol-'ir.g a host/vector system for 
e:-:pres3ion of a desired heterologous product. These 
strategies may involve using ail t.he genes m a m.ulti-step 
pathway, one or several genes, cenes from different organisms, 
or one or more fragme.-.ts of a aene . 

The strategies generally entail evolution of 
gene(s) or segment (s) thereof to allow retention of function 
in a heterologous cell or im.provement of function in a 
ho-.ologous or heterologous cell. Evolution is effected 
generally by a process term-d recursive sequence 
recombination. Recursive sequence recom.bination can be 
achieved in m.any different f::rm3ts and permutations of 
formats, as described m further detail below. These formats 
share some com.mon principles. .Recursive sequence 
recom.bination entails successive cycles of recombination to 
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generate molecular diversity, i.e., the creation of a faniiy 
of nucleic acid molecules showing substantial sequence 
identity to each other but differing in the presence of 
mutations. Each recombination cycle is followed by at least 
5 one cycle of screening or selection for molecules having a 

desired characteristic. The molecule (s) selected in one round 
form the starting materials for generating diversity in the 
next round. In any given cycle, recombination can occur in 
vivo or in vicro. Furthermore, diversity resulting from 
10 recombination can be augmented in any cycle by applying prior 
methods of mutagenesis (e.g., error-prone PGR or cassette 
mutagenesis, passage through bacterial mutator strains, 
treatment with chemical mutagens) to either the substrates for 
or products of recorr>)ina t ion . 

IB ■ 

I . Formats for Recursive Secrjence Recombination 

Some formats and ex::~ples for recursive sequence 

recombination, sometimes referi'od to as DNA shuffling or 

molecular breeding, have beer, described by the present 
20 inventors and co-workers in cc-pending applications, U.S. 

Patent Application Serial N'c . 15/621,430, filed March 25, 

1996; Serial N'o . PCT/US 9 5 / 0 2 1 2 6 , filed February 17, 19 95; 

Serial'No. 08/521,55?, filed Xarch-25, 1995; Serial No. 

03/195,431, filed February 17, 1554; Stemmer, Science 
25 270:1510 (1995); Ster.:r.or et al . , Gene 164:49-53 (1995); 

SteTimer, 5 i o /Techno 1 c"'-- 13:549 -553 ( 1955); Stemmer, Proc . 

Natl. Acad. Sci. U.S.A . 51:1074 7 - 10751 (1994); Stem.ner, Nature. 

370:389-391 (1994); Crarr.eri et al . Nature Medic ine 2(1): 1-3 

(1996); Crameri et al . N'ature r iotechnolccv 14:315-319 (1956), 
30 each of which is incorporated by reference in its entirety for 

all purposes . 

( 1 ) In Vitro ?orT:-.T:s 

One format for recursive sequence recomJoina t ion in 
35 vitro is illustrated in Tig. :. The initial substrates for 
recombination are a pool of related sequences. The X's in 
Fig. 1, panel A, show where the sequences diverge. The 
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sequences can be DNA or RNA and can be of various lengths 
depending on the size of the gene or DNA fragment to be 
recombined or reasseribled . Preferably the sequences are fron 
50 bp to 100 kb. 

The pool of related substrates can be fragmented, 
usually at random, into fragments of from about 5 bp to 5 kb 
or more, as shown in Fig. 1, panel B. Preferably the size of 
the random fragments is from about 10 bp to 1000 bp, more 
preferably the size of the DNA fragments is from about 20 bp 
to 500 bp. The substrates can be digested by a number of 

differ'^r^ m£l^Vl^y^c• r7ii^V, ^.r- n^T^,-^<-.T ^v- nxTn„« o.* ^ • 1 

shearing or restriction enzyme digestion. The concentration 
of nucleic acid fragments of a particular length or sequence 
is often less than 0.1 V cr Ih by weight of the total nucleic 
acid. The nurriDer of different specific nucleic acid fragments 
in the mixture is usually least about 100, 500 or 1000. 

The mixed populacior, of nucleic acid fragments are 
denatured by heading to abz-z 50^ C ,to 100° C, more preferably 
from 90° C to 95° C, to fcrrr. s i r.gle - s tranded nucleic acid 
fragments and then reanr.ealed. Single- stranded nucleic acid 
fragments having regions c: sequence identity with other 
single-stranded nucleic acid fragments can then be reannealed 
by cooling to 20^- C co 75^ C, and preferably from 40° C to 65° 
C. Renaturacicn can be accelerated by the addition of 
polyethylene glycol cr salt. The salt concentration 

IS preferably from C rrJ-: to 500 -y. , more preferably the salt 
concentration is from 10 rr.M to ICO m . The salt may be such 
salts as C-rHJ.SO, , KCl , or N'aCl . The concentration of PEG is 
preferably from 0% to 20^, more preferably from 5V to 10^^. 
Tne fragm.ents that reanneal can oe from, different substrates 
as sho'A'n in Fig. 1, panel C. 

The annealed nucleic acid fragments are incubated 
in the presence of a nucleic acid polymerase, such as Taq or 
:<lenov;, and d.NT?'s (i.e. cAT?. cCT?, dGT? and dTTP) . If 
regions of sequence identity are large, Taq or other high- 
temperature polymerase can be used with an annealing 
temperature of- between 45-55°C. If the areas of identity are 
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- small, Klenow or othef low- temperature polymerases can be used 
with an annealing temperature of between 20-30'=»C. The 
polymerase can be added to the random nucleic acid fragments 
prior to annealing, simultaneously with annealing or after 
annealing . 

The cycle of denaturation, renaturation and 
incubation of random nucleic acid fragments in the presence of 
polymerase is sometimes referred to as "shuffling" of the 
nucleic acid in vicro. This cycle is .repeated for a desired 
nuaiber of times. Preferably the cycle is repeated from 2 to 
100 times, more preferably the sequence is repeated from 10 to 
40 times. The resulting nucleic acids are a family of double- 
stranded polynucleotides of from about 50 bp to about 100 kb. 
preferably from 500 bp to 50 kb. as shown in Fig. i, panel D. 
The population represents variants of che smarting substrates 
showing substantial sequence identity thereto but also 
diverging at several positions. The population has many more 
mer?iers than the starting substrates. The population of 
fragments resulting frox recc~jDina t ion is preferably first 
a-plified by ?CR, then cloned into an appropriate vector and 
the ligation r.ixture used to transform! host cells: 

In a variation of in vitro shuffling, subsecuences 
of recombination substrates can be generated by amplifying the 
full-length sequences under conditions which produce a 
sucstantiai traction, tyri-ically a: least 20 percent or more, 
of incompletely extended arr.pl i f i c a t ion products. The 
amplification products, including the incompletely extended 
a-plif ication products are denatured and subjected to at least 
one additional cycle of reannealinc and amplification. This 
variation, wherein at least one cycle of reannealing and 
amplification provides a substantial fraction of incompletely 
extended products, is terr.ed "stuttering." In the subsequent 
ar.plif ication round, the inccr.pletely extended oroducts anneal 
to and prime extension on differe.nt sequence - re la ted template 
species . 

In a further variation, at least one cycle of 
amplification can be conducted using a collection of 
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overlapping single - stranded DNA fragments of .related seouence, 
and different lengths. Each fragment can hybridize to and 
prime polynucleotide chain extension of a second fragment fro:, 
the collection, thus forming sequence-recombined 
polynucleotides. In a further variation, single-stranded DNA 
fragments of variable length can be generated from a single 
primer by Vent DNA polymerase on a first DNA template. Th- 
single stranded D.MA fragments are used as primers for a 
second, Kunkel-type template, consisting of a 

uracil-containing circular sirigle-stranded DNA. This results 
in multiple substitutions of the first template into the 
second {see Levichkin ec al . Mol . Biology 29:572-577 (1995)). 

Gene clusters such as those involved in polyketide 
synthesis (or indeed any -ulti -enzyme pathways catalyzinc 
analogous metabolic reactio.^.s) can be recombined by recursive 
sequence recombination even if -nsy lack DNA sequence 
homology. Homology ca.-. be introduced using synthetic 
oligonucleotides as ?CR prirr.ers. In addition'to the soeci-c ' 
sequences for the gene cem:: a.-plified, all of the orimers 
used to amplify one t>T>e of en2:.~,e (for e.xample the acyl 



carrier protei.n in oo 



svr. tne 



sis) are synthesized to 



contain an additio.nal sequence o: 20-40 bases 5' to the gene 
(sequence A) and a' different 20-40 base seque.nce 3' to th° 
gene (secue.nce 3) . The acjacenc gene (m this case th^ >.co. 
s>-thase) is amplified using a 5' primer which contains the 

ccxp 1 e rr.e n t a rv s 



, a. . . 



sequence B (ssq^uence B'), and a 3 



primer containing a difreren- 20--;0 base sequence (C) . 
Similarly, primers for the .ne.xt adjacent gene (keto- 
reductases) contain sequences C (complementary to C) and D. 
If 5 different polyketide gene clusters are beinc shuffled 
all five acyl carrier proteins are flanked by sequences A and 
= following their ?CR amplification. In this way, small 
regions of hom.ology are introduced, m.aking the gene clusters 
into site-specific recomi^ina t ion cassettes. Subsecuent to the 
initial amplification of individual genes, the am.olified genes 
can then be mixed and subDected to primerisss PC?.. Seau-nc- B 
a^. the 3' end of all of the five acyl carrier protein genes 
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can anneal with and prime DNA synthesis from sequence B' at 
the 5' end of all five keto reductase genes. In this way all 
possible combinations of genes within the cluster can be 
obtained. Oligonucleotides allow such recombinants to be 
obtained in the absence of sufficient sequence homology for 
recursive sequence recombination described above. Only 
homology of function is required to produce functional gene 
clusters . 

This method is also useful for exploring 
permutations of any other rr.ult i -subunit enzymes. An example 
of such enzymes composed of multiple polypeptides that have 
shown novel functions when the subunits are combined in novel 
ways are dioxygenases . Directed recombination between the 
four protein subunits of biphenyl and toluene dioxygenases 
produced functional dioxygenases with increased activity 
against t richloroechy lene (Furukawa et . al. J. Bacteriol. US 
2121-2123 (1994)). This ccTJoination of subunits from the two 
dioxygenases could also have been produced by cassette - 
shuffling of the dioxygenases as described above, followed by 
selection for degradation of : r ichloroethylene . 

In some poiyketide synthases, the separate 
functions of the acyl carrier protein, keto- synthase , keto- 
reductase, etc. reside in a siKgle polypeptide. In these 
cases domains within -he single polypeptide may be shuffled, 
even if sufficient hcroiogy does not exist naturally, by 
introducing recions of ho"oiogy as described above for entire 
Genes. In this case, it may not be possible to introduce 
additional flanking sequences to the domains, due to the 
constraint of maintaining a continuous open reading frame. 
Instead, groups of oligonucleotides are synthesized that are 
homologous to the 3' end of the firsc domain encoded by one o 
the genes to be shuffled, and the 5' ends of the second 
domains encoded by all of the ccher genes to be shuffled 
together. This is repeated -.sith all domains, thus providing 
sequences that allow recombination between protein domains 
while miaintaining their order. 
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10 



20 



The cassetce-based recombination mechod can be 
combined with recursive sequence recombination by including 
gene fragments (generated by DNase, physical shearing, DNA 
stuttering, etc.) for one or more of the genes. Thus,- in 
addition to different combinations of entire genes within a 
cluster (e.g., for polyketide synthesis), individual genes can 
be shuffled at the same time (e.g., all acyl carrier protein 
genes can also be provided as fragmented DNA) , allowing a more 
thorough search of sequence space. 



(2 ) In Vivo Forma 

Plasmid -Plasmid Reconr±)i nation 
The initial substrates for recombination are a 
collection of polynucleotides co~prising variant forms of a 
15 gene. The varia.-t forms usually show substantial sequence 
ice.ntity to each other suff — ler.t to allow homologous 
recombination between subs-.r2-.es. The diversity between the 
poly.nucleotides can be narur^l (e.g., allelic or soecies 
variants), induced ie.c.. c-.-.-cr- prone PC.Ror error-prone 
recursive sequence reccr-i;-.,: ,0." ) . or the result of in vitro 
recombination. Diversity i.n also result from resynthesi.inc 
55r.es encoding natural proteins with alternative codon usaae. 
Tnere should be at least sufficient diversity between 
^'■^^^'-'^^-^ re = o-:.i.-.stio.-^. car. generate more diverse 

25 products than there ar; starting materials. There must be at 
least tv.-o substrates -ff-^ring ir, at least two positions. 

' -=~~or.iy a licrary of substrates of 10^-10' members is 
employed. The degree cf diversity depends on the length of 
the substrate being recomoir.ed and the extent of the 
30 functional change to be evolved. Diversity at between 0.1-25'. 
o: positions is typical. The diverse substrates are 
incorporated into plas.mids. The plasmids are often standard 
cloning vectors, e.g., bacterial multicopy plasmids. However, 
in som,e methods to be desciMed below, the plasmids include 
35 m.obilization (HOB) functic.-:,?. The substrates can be 

incorporated into the same or different plasmids. Often at 
least two different tvn^es of piasmid having different types of 
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-selectable markers are~ used to allow selection for cells 
containing at least two types of vector. Also, where 
different types of plasmid are employed, the different 
plasmids can come from two distinct incompatibility groups to 
allow stable co-existence of two different plasmids within the 
cell. Nevertheless, plasmids from the same incompatibility 
group can still co-exist within the same cell for sufficient 
time to allow homologous recombination to occur. 

Plasmids containing diverse substrates are 
initially introduced into cells by any method (e.g., chemical 
transformation, natural competence, electroporation, 
biolistics, packaging into phage or viral systems). Often, 
the plasmids are present at or near saturating concentration 
(with respect to maximum transfection capacity) to increase 
the probability of more than one plasmid e.ntering the same 
cell. The plasmids containing the various substrates can be 
tra.nsfected simultaneously or ir. multiple rounds. For 
example, in the latter approach cells can be transfected with 
a first aliquot of plasmid, t rBr.5 f ectant s selected and 
propagated, and then infected wi:h a second aliquot of 
plasmid. 

Having intrcd'jced the plasmids into cells, 
recorrjoina t ion between substraces to generate recorrJoinant genes 
occurs within cells containing multiple different plasmids 
cerelv bv orooacatinn the cells. However, cells that receive 
cni\' one pi a sr. id are unable to participate in reccrri)i nation 
and the potential contribution o: substrates on such plasmids 
to evolution is not fully exploited (although these plasmids 
may contribute to some extent if they are progagated in 
mutator cells) . The rate of evolution can be increased by 
allowing all substrates to participate in recombination. Such 
can be achieved by subjecting transfected cells to 
electroporation. The conditions for electroporation are the 
same as those conventionally v:sed for introducing exogenous 
Dy.h into cells (e.g., 1,000-2,500 volts, 400 pF and a 1-2 mM 
gap). Under these conditions, plasmids are exchanged between 
cells allowing all substrates to participate in recombination. 



wo 97/35966 



PCT/US97/04715 



15 



In addition the products of recombination can undergo further 
rounds of recombination with each other or with the original 
substrate. The rate of evolution can also be increased by use 
of conjugative transfer. To exploit conjugative transfer, 
substrates can be cloned into plasmids having MOB genes, and 
tra genes are also provided in cis or in trans co the MOB 
genes. The effect of conjugative transfer is very similar to 
electroporation in that it allows plasmids to move between 
cells and allows recombination between any substrate and the 
products of previous recombination to occur, merely by 
propagating the culture. The rate of evolution can also be 
increased by fusing cells to induce exchange of plasmids or 
chromosomes. Fusio.n can be induced by chemical agents, such 
as PEG, or viral proteins, such as influenza virus 
15 hemagglutinin, HSV-1 g3 and gD . The rate of evolution can 

also be increased by use of nutator host cells (e.g., Mut L, ' 
S, D, T, :-: in bacteria and .-.ta.via telangiectasia human cell 
lines ) . 



10 



20 



35 



The time to. whicn cells are propagated and 
recombination is allowed to cccur, of course, varies with the 
cell type but is generally not critical, because even a small 
decree of recombination can substantially increase diversity 
relative to the starting materials. Cells bearing plasmids 
containing reccrr.bined genes are subject to screening or 
25 selection for a desired function. For exarr.ple, if the 

substrate being evolved contains a drug resistance gene, one 
would select for drug resistance. Cells surviving screening 
or selection can be subjected to one or more rounds of 
screening/selection followed by recombination or can be 
30 subjected directly to an additional round of recombination. 

The next round of reccroinat ion can be achieved by 
several different formats i.ndependent ly of the previous round. 
For example, a further round of recoribination can be effected 
simply by resuming the electroporation or con]ugation-mediated 
intercellular transfer of oiasmids described above. 
Alternatively, a fresh substrate or substrates, the same or 
different from previous substrates, can be transfected into 
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cells surviving selection/screening. Optionally, the new 
substrates are included in plasmid vectors bearing a different 
selective marker and/or from a different incompatibility group 
than the original plasmids. As a further alternative, cells 
5 surviving selection/screening can be subdivided into two 
subpopulat ions , and plasmid DMA from one subpopulat ion 
transfected into the other, where the substrates from the 
plasmids from the two subpopulat ions undergo a further round 
of recombination. In either of the latter two options, the 

10 rate of evolution can be increased by employing DNA 

extraction, electroporat ion , conjugation or mutator cells, as 
described above. In a still further variation, DNA from cells 
surviving screening/selection can be extracted and subjected 
to in vitro recursive sequence recorrJoinat ion . 

15' After the second round of recombination, a second 

round of screening/selection is performed, preferably under 
conditions of increased stringency. If desired, further 
rounds of recorabina t ion and selection/screening can be 
performed using the same scracegy as for the second round. 

20 With successive rou.nds of reccTjirinacion and 

selection/screening, che surviving recombined substrates 
evolve toward acquisition of a desired phenot;/pe . Typically, 
in this and other methods of recursive recombination, the 
final product of reccrJ^i nac ion that has acquired the desired 

2 5 phenotyoe differs frc<:r starting substrates at 0.1%-25ir of 
positions and has evolved at a rate orders of magnitude in 
excess (e.g., by at least IC-fold, lOO-fold, 1000-foid, cr 
10,000 fold) of the rate of naturally acquired mutation of 
about 1 mutation per 10'^ positions per generation (see 

30 ;^Jiderson et al . Proc. Natl. Acad. 5ci . U.S.A . 93:906-907 

(1996)). The "final product" may be transferred to another 
host more desirable for utilization of the "shuffled" DNA. 
This is particularly advantageous in situations where the more 
desirable host is less efficient as a host for the many cycles 

35 of mutation/recombination due to the lack of molecular biology 
or genetic tools available for other organism.s such as E. 
coli . 
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Vinis-Pla smid Rprotnbinatiinn 
The strategy used for plasmid-plasmid recombination 
can also be used for virus -plasmid recombination; usually, 
phage-plasmid recombination. However, some additional 
5 comments particular to the use of viruses are aporopriate 

The initial substrates for recombination are cloned into both 
plasmid and viral vectors. it is usually not critical which 
substrate (s) are inserted into the viral vector and which into 
the plasmid, although usually the viral vector should contain 
10 Different substrate (s) from the plasmid. As before the 

plasmid (and the virus) typically contains a selective marke>- 
me plasmid and viral vectors can both be introduced into 
cells by transfection as described above. However, a mo>-e 
efficient procedure is to transfect t.he cells with clasmid, 
1:^ select trans fectants and infect the trans fectants with virus. 

Because the f^ff I'c'pnnv ^ 

-.E-c.ency ....ec.ion of many viruses aooroach-s 

100% of cells, nost cells tra..sfected and infected by t- s 
route contain both a pias..ic a.- virus bearing different 

s'jbscraces . 

recoT.bi.-.ation occurs between plasxid and 

Virus Q'^n^Ta.t. ^ h.n r H o ^ - ~ ^ ' ^ ' > 

Virus. Fcr some viruses cr'-.-n ^ ^ } -.^^ ^ 

-^ui._b, s^_n as ^liamer.tious phage., in which 

1.-. cotn couble-stranded a.-.d single- 
^c. cci.. = icipate m recca-Joination . 
2d Provided that tb° -i-c „„„ ^ . 

one tnat rapidly kills cells, 

recorr-^ir.atio.. can ce a.grr.ented by use of e iect rooorat ion or 
conjugation to tra.nsfer plas.ids between cells. Reco,-nbinat ion 
can also be augmented fo. so.e t>^es of virus by allowing th^ 



30 



35 



progeny virus from o.ne cell to reinfect ot.her cells. For som^ 
t-^es of virus, virus infected-cells show resistance to 
superinfection. However, such resistance can be overcome by 
erecting at high multiplicity a.nd/or using mutant strains of 
t.ae virus in which resista.nce to superinfection is reduced 
The result of infecting piasrrad - containing ce^s 

with virus depends on the natu-^ of th« vi -n^ 

■ _ Ui. cne virus. Some vi>-uses 

such as filamentous phage, staoly exist with a olasmid in th^ 
cell and also extrude progeny phage from the cell. other 
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Viruses, such as lambda having a cosmid genome, stably exist 
in a cell like plasmids without producing progeny virions. 
Other viruses, such as the T-phage and lytic lambda, undergo 
recombination with the plasmid but ultimately kill the host 
cell and destroy plasmid DNA. For viruses that infect cells 
without killing the host, cells containing recombinant 
plasmids and virus can be screened/selected using the same 
approach as for plasmid-plasmid recombination. Progeny virus 
extruded by cells surviving selection/screening can also be 
collected and used as substrates in subsequent rounds of 
recombination. For viruses that kill their host cells, 
recombinant genes resulting from recombination reside only in 
the progeny virus. If the screening or selective assay 
requires expression of recorjDinant genes in a cell, the 
recombinant genes should be cransferred from the progeny virus 
to another vector, e.g., a vlasrrrLd vector, and retransf ected 
into cells before selection/screening is performed. 

For filamentous pr.a-ce, the products of 
recombination are present ir. i^-tr. cells surviving 
recombination and in phage extruded from these cells. The 
dual source of recorriDinar. t products provides some additional 
options relative to the plasrr.id-plasmid recombination. For 
example, DNA can be isolated frox phage particles for use in a 
round of in vicro re ccr:-: i n a t ion . Alternatively, the progeny 
phage ca,n be used to tr 
previous rou.nd of scre^ 

transfected with fresh substrates for recombination 



:sfect or infect cells surviving a 
.r.c / selection, or fresh cells 



(c) Virus-Virus Recombination 
The principles described for plasmid-plasmid and 
plasmid- viral recombination can he applied to virus-virus 
recorrj^ination with a few r.ch i f i cat ions . The initial 
substrates for recombina t ic:. are cloned into a viral vector. 
Usually, the same vector is u^ec for all substrates. 
Preferably, the virus is cr.e that, naturally or as a result of 
mutation, does not kill cells. After insertion, some viral 
genomes can be packaged in vi trc or using a packaging cell 
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line. The packaged viruses are used Co infect cells at high 
multiplicity such that there is a high probability that a cell 
will receive multiple viruses bearing different substrates. 

After the initial round of infection, subsequent 
steps depend on the nature of infection as discussed in the 
previous section. For example, if the viruses have ohagemid 
genomes such as lambda cosmids or M13, Fl or Fd phagemids, the 
phagemids behave as plasmids within the cell and undergo 
recombination simply by propagating the cells. Recombination 
is particularly efficient between single-stranded forms of 
intracellular DNA. Recombinacion can be augmented by 
electroporation of cells. 

Following selection/screening, cosmids containing 
recombinant genes can be recovered from surviving cells, e.g., 
by heat i.nduction of a cos' lysogenic host cell, or extraction 
or DNA by standard procedures, followed by repackaging cosmic' 
DiN'A in vitro. 

If tne viruses are filamentous phage, recombination 
of replicating form DNA occurs by propagating the culture of 
infected cells. Selection/s zree.-^.ing identifies colonies of '.' 
ceils concaini.-g viral vecrcrs having recombinant genes with . 
improved properties, together v.-th phage extruded from such 
cells. Subsequent optio.-.s are essentially the same as for 
p 1 asrr.id - vi ra 1 recombination 

Chronoso -e Recorrubina t: ion 

inis zor-az can be 'osed to especially evolve 

chromosomal substrates. The format is particularly useful in 

situations in which many chror.Dsomal genes contribute to a 

phenotype or one does r.oz know the exact location of th^ 

chrom.osomal gene(s) to be evolved. The initial substrates for 

recomoination are cloned into a plasmid vector. If the 

chromosom.al gene(s) to be evolved are' known, the substrates 

constitute a family of sequences showing a high degree of 

sequence identity but some c-----o^nr- f-or, 

_-._-^_..c^ rrom the cnromoso.mal 

gene. if the chromosomal ger.es to be evolved have not been 
located, the initial substrates usually constitute a library 
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"of DMA segments of which only a small nurrJoer show sequence 
identity to the gene or gene(s) to be evolved. Divergence 
between plasmid-borne substrate and the chromosomal gene(s) 
can be induced by mutagenesis or by obtaining the plasmid- 
5 borne substrates from a different species than that of the 
cells bearing the chromosome. 

The plasmids bearing substrates for recombination 
are transfected into cells having chromosomal gene(s) to be 
evolved. Evolution can occur simply by propagating the 

10 culture, and can be accelerated by transferring plasmids 

between cells by conjugation or electroporat ion . Evolution 
can be further accelerated by use of mutator host ceils or by 
seeding a culture of nonrnucator host cells being evolved with 
rr.utator host cells and inducing intercellular transfer of 

15" plasmids by e lectropora t ion or conjugation. Preferably, 
mutator host cells used for seeding contain a negative 
selectable marker to faciiica.e isolation of a pure culture of 
the nonT.utator cells being evolved. Selection/screening 
identifies cells bearing chrorr.osorr.es and/or plasmids that have 

20 evolved toward acquisition of a desired function. 

Subsequent rounds of recorrJoina t ion and ' 
selection/screening oroceed in sirr.ilar fashion to those 
described for plasrr.id - plasnr.ic recorabinat ion . For example, 
further recorribi nat i on can be effected by propagating cells 

2 5 survivinc reccrrJoina t icn in corr.bina:: ion wich e lectropora t ion or 
conjugative transfer of plasrr.ids. Alternatively, plasmids 
bearing additional substrates for recorrjiDina t ion can be 
introduced into the surviving cells. Preferably, such 
plasmids are from a different incompatibility group and bear a 

30 different selective marker than the original plasmids to allow 
selection for cells containing at least two different 
plasmids. As a further alternative, plasmid and/or 
chromosomal DNA can be isolated from a subpopulat ion of 
survivina cells and transfected into a second subpopulat ion . 

35 Chromosomal DNA can be cloned into a plasmid vector before 
transiect ion . 
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(e) v'irus- Chromosome Recombination 
As in the other methods described above, the virus 
is usually one that does not kill the cells, and is often a 
phage or phagemid. The procedure is substantially the same as 
for piasmid-chromosome recombination. Substrates for 
recombination are cloned into the vector. Vectors including 
the substrates can then be transfected into cells or in vicro 
packaged and introduced into cells by infection. Viral 
genomes recombine with host chromosomes merely by propagating 
a culture. Evolution can be accelerated by allowing 
intercellular transfer of viral genomes by electroporation, or 
reinfection of cells by progeny virions. Screening/selection 
identifies cells having chroT.osomes and/or viral genomes tha^ 
have evolved toward acquisition of a desired function. 

There are several opcions for subsequent rounds of 
recorr.binacion . For exa-ple. viral genomes can be transferred 
between cells sur\^ivinc se i-a c: icn/recombina t ion by 
elecrroporat ion . Al terna " i veiy , viruses extruded from cells 
surviving select ic-./screening can be pooled and used to 
20 superinfect the cells at high "ui c ipl ici t y . Alternatively, 

rresh subscrates for recc.-?.bir.a t ion can be introduced into the 
cells, either on oiasr.id or viral vectors, 

- ^ • Recursi ve Secr^en-e Recombination Techniques for Metabolic 
and Cellular Encmeering 

A. . Startinc Materia}. s 

Thus, a general method for recursive sequence 
recorjDina tion for the erJocdirr.en t s herein is to begin with a 
gene encoding an enzyr.e or enzyr.e subunit and to evolve that 

30 gene either for ability to act on a new substrate, or for 

enhanced catalytic properties with an old substrate, either 
alone or in combination with other genes in a multistep 
pathway. The tern "gene" is used herein broadly to refer co 
any segment or sequence of DN'A associated with a biological 

35 iiinction. Genes can be obtained from a variety of sources, 
including cloning fron^. a source of interest or synthesizina 
from known or predicted sequence information, and may include 
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'sequences designed to have desired parameters. The ability to 
use a new substrate can be assayed in some instances by the 
ability to grow on a substrate as a nutrient source. In other 
circumstances such ability can be assayed by decreased 
toxicity of a substrate for a host cell, hence allowing the 
host to grow in the presence of that substrate. Biosynthesis 
of new compounds, such as antibiotics, can be assayed 
similarly by growth of an indicator "organism in the presence 
of the host expressing the evolved genes. For example, when 
an indicator organism used in an overlay of the host 
expressing the evolved gene is) , wherein the indicator organism 
is sensitive or expected to be sensitive to the desired 
aniibiocic, growth of the indicator organism would be 
inhibited in a zone around the host cell or colony expressing 
the evolved gene(s) . 

Another method of idencifying new compounds is the 
use of standard analytical techniques such as mass 
spectroscopy, nuclear mag.net ic resonance, high performance 
liquid chromatography, etc. Recc~binant microorganisms can be 
pooled and extracts or media supernatants assayed from these 
pools. Any positive pool can then be subdivided and the 
procedure repeated until the single positive is identified 
( "sib- selection" ) . 

In some instances, the smarting material for 
recursive sequence reccmbinat icn is a discrete gene, cluster 
of genes, or family of genes known or thought to be associated 
v.'ich metabolism of a particular class of substrates. 

One of the advantages of the instant invention is 
that: structural information is noc required to estimate which 
parts of a sequence should be mutated to produce a functional 
hybrid enzyme . 

In some erribodiment s of the invention, an initial 
screening of enzyme activities in a particular assay can be 
useful in identifying candidate enzymes as starting materials. 
For' example, high throughput screening can be used to screen 
enzymes for dioxygenase - type activities using aromatic acids 
as substrates. Dioxygenases topically transform indole-2- 
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carboxylate and indole - 3 -carboxylate to colored products, 
including indigo (Eaton et . al . J. BactPrinI 177:6983-6988 
(1995)). DNA encoding enzymes that give some activity in th-: 
initial assay can then be recombined by the recursive 
techniques of the invention and rescreened. The use of such 
initial screening for candidate enzymes against a desired 
target molecule or analog of the target molecule can be 
especially useful to generate enzymes that catalyze reactions 
of interest such as catabolism of man-made pollutants. 

This type of high throughput screening can also be 
used during each round of recursive sequence recombination to 
Identify mutants that possess the highest level of the desire: 
activity. For example, penicillin G acylases have been 
isolated by loolcing for clo..es chat allow a leucine auxocrooh 
CO hydrolyse penicil li.-^. G a.-.alogue phenylacecyl -L- leucine , 
thereby producing leucine ar.z allowing cell growth (Marti.n. L. 

Microbiol pc:v : 125:287- 292 (1995)). 

Positives fron this select :r.-. are then screened bv a more 

.1^^.15. v_ n^^r.oG .:di:y to hyarolyse penicillin G. 

This sane Seie'^t''^" ^-^^ -^hci'^^y*^r■^-^/^ t ^ • * 

-^^^^ pneny^aceLyl -u-ieucine can 

illin G acylase for greater 
aciivicy by recursive sequence r e cc^ma c ion , After each ' 
round of recorr^bmauion zr.e library of acylase genes is 
cransforr^ed into a leucine auxo-oph. Tnose that grow fastest 

are picked as crcb^.h'-/ rr^^^ir^^ . 

awuj^ve acylase. The 

acylases are v-i - - -.^^ ^ _ „ „ . ^ • , 

^_ a'^d^r.^., tne real substrate, 

penicillin G, by a rore laborious screen such as HPLC. Thus 

even if there is no convenient hign throughput screen for an' 

enzyrne or a metabolic pathway, it is often possible to find a 

rapid detection rrethod that can approxir^a te ly measure the 

desired Dhenotvxie, ch'=>-^p'T' v-o-.. ^ ; ^v,^ „, u ^ , 

•-jij-, i-i._.eD. .e.^„^^.._ ^ne numbers of colonies 

that must be screened more acruratelv 

The starting -at-irial ca." also be a segment of such 
a gene or cluster that is rorr-.omed m isolation of ^ ts 
surrounding DNA, but is relir..-.-- to its surrounding DNA b=>fo^^ 
screening/selection of recc:;-,bi.-.a t ion products. in other 
instances, the starting material for recombination is a larg->- 
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segment of DNA that includes a coding sequence or other locus 
associated with metabolism of a particular substrate at an 
unknown location. For example, the starting material can be a 
chromosome, episome, YAC, cosmid, or phage PI clone. In still 
other instances, the starting material is the whole genome of 
an organism that is known to have desirable metabolic 
properties, but for which no information localizing the genes 
associated with these characteristics is available. 

In general any type of cells can be used as a 
recipient of evolved genes. Cells of particular interest 
include many bacterial cell types, both gram-negative and 
gram-positive, such as Rhodococcus , StrepComycetes , 
Accinomyceces , Corynehacceria , Penicillium, Bacillus, 
Escherichia coli. Pseudononas , Salmonella, and Erwinia, Cells 
of interest also include eukaryotic cells, particularly 
mammalian cells (e.g., mouse, ha-szer , primate, human), both 
cell lines and primary cultures. Such cells include seem 
cells, including eT±;ryonic see- cells, zygotes, fibroblasts, 
lymphocytes, Chinese hamster ovary (CHO) , rr.ouse fibroblasts 
(NIH3T3), kidney, liver, xuscle, and skin cells. Other 
eukaryotic cells of ir.::erest include plant cells, such as 
maize, rice, wheat, cotter., soybean, sugarcane, tobacco, and 
arabidopsis; fish, algae, fungi ( Peni ci i i i ur., fusariu-^, 
Aspergillus, Podospora , ::eurospo:ra) , insects, yeasts iPicchia 
and Saccharc:uyces) . 

The choice of host will depend cn a number of 
factors, depending on the intended use of the engineered host, 
including pathogenicity, substrace range, environmental 
hardiness, presence of key intermediates, ease of genetic 
manipulation, and likelihcod cf promiscuous transfer of 
genetic information to other organisms. Particularly 
advantageous hosts are E. coli. laccobacil 1 1 , Screptomycezes , 
Actinorr>yceces and f i lainenzo'js :ungi. 

The breeding prccec'^re starts with at least two 
substrates, which Generally show substantial sequence identity 
to each other (i.e., at least about 50%, 70%, 80% or 90% 
sequence identity) but differ from each other at certain 
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positions. The difference can be any type of mutation, for 
example, substitutions, insertions and deletions. Often, 
different segments differ from each other in perhaps 5-20 
positions. For recombination to generate increased diversity 
relative to the starting materials, the starting materials 
must differ from each other in at least two nucleotide 
positions. That is, if there are only two substrates, there 
should be at least two divergent positions. If there are 
three substrates, for example, one substrate can differ'from 
the second as a single position, and the second can differ 
from the third at a different single position. The starting 
DNA segments can be natural variants of each other, for 
example, allelic or species variants. The segments can also 
be from nonallelic genes shoving some degree of structural and 
usually functional relatedness (e.g., different genes within a 
superfamily such as the inrr-.nogiobulin superfamily) . The - 
starting D.\A segments can also be induced variants of each 
other. For example, o.-^e DNV. segr^enc can be produced bv error- 
prone ?CR replication of the other, or by subst itut ion' of a., 
mutagenic cassette. Induced mutants can also be oreoared bv 
prcpagati.ng one (or both) of che segments in a mutagenic 
strain. ir. these si-ua-.ions, strictly speaking, the second 



segr^.ent is r.cz a si.ngle secT.ent but a large family of 

rela-ed segments. The diff--'-'=n- 30—^^^=: '^o^---;^- 

cs i.ori.,i.-ig Che startmc 

rr.aterials are cfc = ." s-~'-= ■. o---'-, ^..u ^ ■ 

SUDS t an c la 1 Iv the same 

ie.-.cch. Hov.'ever, this need not be :he case; for examole; on- 
segment can be a subsequence of another. The segments can be 

as vectors, or can 



present as part of larger molecules su 



2e m isolated form. 

The starting D.NV^ segments are recombined bv anv of 
t.he recursive sequence reccmbinat io.. form.ats described above 
to generate a diverse library of reco.-nbmant DN'A segments. 
Such a library can vary widely m size from having fewer than 
10 to m.ore than 10^ 1Q\ or -J r-.embers . In general, the 
starting segrr.ents and che recorriinant libraries generated 
include full-length coding sequences and any essential 
regulatory sequences, such as a promoter and polyadenylat ion 



wo 97/35966 



PCT/US97/04715 



26 

sequence, required for expression. However, if this is not 
the case, the recombinant DNA segments in the library can be 
inserted into a common vector providing the missing sequences 
before performing screening/selection. 

If the recursive sequence recombination format 
employed is an in vivo format, the library of recombinant DNA 
segments generated already exists in a cell, which is usuallv 
the cell type in which expression of the enzyme with altered 
substrate specificity is desired. If recursive sequence 
recombination is performed in vitro, the recombinant library 
is preferably introduced into the desired cell type before 
screening/selection. The members of the recombinant library 
can be linked to an episome or virus before introduction or 
can be introduced directly. In some embodiments of the 
invention, the library is ar.plified in a first host, and is 
then recovered from that host and introduced to a second host 
more amenable to expression, selection, or screening, or any 
other desirable parameter. The xanner in which the library is 
introduced into the cell tv^e depends on the DNA-uptake 
characteristics of the cell t\^e , e.g., having viral 
receptors, being capable of conjugation, or being naturally 
co~petent. If the cell tv^e is insusceptible to natural and 
chemical - induced co~petence, but susceptible to 

e lect ropora t ion , one vo-ld usually e.Tiploy e lect ropora t ion . If 
the cell type is insusceptible to e lect roporat ion as well, one 
can employ biolistics. The biolistic PDS-iOOO Gene Gun 
(Siorac, Hercules, CA) uses heliux pressure to accelerate 
DN.-.-coated gold or tungsten microcarr iers toward target cells. 
The process is applicable to a wide range of tissues, 
including plants, bacteria, fungi, algae, intact animal 
tissues, tissue culture cells, and animal emjDryos , One can 
employ electronic pulse delivery, which is essentially a mild 
electroporat ion formac for live tissues in animals and 
patients. Zhao, Advanced Drue Delivery Reviews 17:257-262 
(1995) . Novel m^ethods for r.aking cells competent are described 
in co-pending application U.S. Patent Application Serial No. 
08/621,430, filed March 25, 1995. After introduction of the 
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. library of recombinanc" DNA genes, the cells are opcionally 
propagated to allow expression of genes to occur. 

B • Selecti on and Screening 

Screening is, in general, a two-step process in 
which one first determines which cells do and do not exoress a 
screening marker and then physically separates th- cells 
having the desired property. Selection is a form of screenm. 
xn which identification and physical separation are ach^.ved " 
simultaneously, for exa.T.ple, by expression of a selectable 
marker, which, in some genetic circumstances, allows cells 
expressing the marker to survive while other cells die (o- 
vice versa). Screening markers include, for exaxoie 
luciferase, /3-galactosidase, and green fluorescent oroce^n 
Screening can also be done by observing such asoects o" g^ow-'^ 
as colony size, halo formation, etc. Additionally, sc^e^n-'r^ 
nor production of a desired compound, such as a theraoeutic ' 
drug or "designer chemical" can oe accomplished by obserVna 
binding of cell products to a receptor or ligand, such as^o^ a 
solid support or on a column. Such screening can additionailv 
. -5...^ oy o .o or.ciooaies, as in an ELISA. In. 

some instances the scree-^-c -^^o-oee ^>-^^ -i 

^^.o.ess IS preteraoly automated 

so as to allow screening of suitable numbers of colonies or 

ceils. Some e.xamol^c o- a^-o-a-o- 

■ -- a_.o.,.aue^ screening cevices include ' 

riuorescence ac- -ot-, ^-,^-;„^ . 

" :=--;.^ng, especially in conjunction 

^ -rr.._o.^^,e^ m agarose (see Pow-ii oc a^ 
^i^^^l^chnoloov 8:333-337 ,1990),. Weaver et . al' Mec^ 2:23.- 

(1991)), automated ELISA assays, sci.nti 1 lat ion oroxim^ ^ y 
assays (Hart, H.E. et al . , Molecula^jnm^ 16:2s"5-267 
(1979)) and the formation of fluorescent, coloured or W 
aosorbing cor.pou.nds on agar plates or in microtit^-^ w-ns 

Devel. Inn..- M.cmh.ni..., . 

Selectable markers can include, for exam.oie, drug, tox^- 
resistance, or nutrient synthesis genes. Selection is also 
cone by such techniques as cro.th on a toxic substrate to 
select tor hosts having the abilit/ to detoxify a substrate 
growtn on a new nutrient source to select for hosts having tb^ 
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• ability to utilize that nutrient source, competitive growth in 
culture based on ability to utilize a nutrient source, etc. 

In particular, uncloned but differentially 
expressed proteins (e.g., those induced in response to new 
5 compounds, such as biodegradable pollutants in the medium) can 
be screened by differential display (Appleyard et al . Mol . 
Gen. Gent. 247:338-342 (1995)). Hopwood ( Phil Trans R. Soc . 
Lond B 324:549-562) provides a review of screens for 
antibiotic production. Omura ( Microbio. Rev. 50:259-279 
10 (1986) and Nisbet ( Ann Reo. Med. Chem. 21:149-157 (1986)) 
disclose screens for antimicrobial agents, including 
supersensitive bacteria, detection of )3-lactamase and D,D- 
carboxypeptidase inhibition, /^-lactamase induction, 
chromogenic substrates and rronoclonal antibody screens. 
15' Ancibiocic targets can also be used as screening targets in 

high throughput screening. i fungals are typically screened 

by inhibition of fungal crcwih . Pharmacological agents can be 
identified as enzyrr.e inhibitcrs using plates containing the 
enzyTTie and a chromogenic surszrate, or by automated receptor 
20 assays. Hydrolytic enzv—es ,'e.c., proteases, amylases) can 
be screened by includir.- ihe substrate in an agar plate and 
scoring for a hycrclvtic clear zone or by using a colorimetric 
indicator (Steele et al. An n . '-^e '.' . f- : i c r o b i o 1 . 45:89-106 
(19 31)) . This car. be ru-joled wiih the use of stains to detect 
25 the effects of en7-'.™e aciicn (such as congo red to detect the 
extent of decradaticr. c: cellulcses and hemicel luloses ) . 
Tagged substrates can also be used. For example, lipases and 
esterases can be screened using different lengths of fatty 
acids linked to uobelliferyl. The action of lipases or 
30 esterases removes this tag from the fatty acid, resulting in a 
quenching or enhancement of ur.be 1 1 i feryl fluorescence. These 
enzymes can be screened ir. r.irrctiter plates by a robotic 
device , 

Fluorescence a c e d cell sorting (FACS) methods 
35 are also a powerful tool fr-r -e lect ion/screening . In some 

instances a fluorescent rr.clecule is made within a cell (e.g., 
green fluorescent protein) . The cells producing the protein 
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can simply be sorted by FACS . Gel microdrop technologv allows 
screening of cells encapsulated in agarose microdroos (Weaver 
et al. Methods 2:234-247 (1991)). In this technique products 
secreted by the cell (such as antibodies or antigens) are 
- immobilized with the cell that generated them. Sorting and 
collection of the drops containing the desired product thus 
also collects the cells that made the product, and provides a 
ready source for the cloning of the genes encoding the desired 
functions. Desired products can be detected by incubating the 
encapsulated cells with fluorescent antibodies (Powell ec al, 
Bio/T-echnoloqy 5:353-337 (1990)). FACS sorting can also be 
used by this technique to assay resistance to toxic compounds 
and antibiotics by selecting droplets that contain multiple 
ceils (i.e., the product of continued division in the presence 
of a cytotoxic corr.pou."c ; Goguen et al. Nature 363:189-190 
(1995)). This method can select for any enzy^.e that can 
change the fluorescence of a substrate that can be immobilized 
in the agarose drcpiet. 

In some e.TjccdiTjr.is cf the invention, screenina can 
be accomplished by assay:^-'^ reactivity wich a reporter 
molecule reactive wicn a cesired feature of, f or * examp le , a 
gene product. Thus, specific functionalities such as 
antigenic domains can be screened v.'ith antibodies specific for 
those determinants. 

In other emccciments of the invention, screening is 
preferably done witn a cell-cell indicator assay. In this 
assay format, separate library cells (Cell A, the cell being 
assayed) and reporter cells (Cell B, the assay cell) are used. 
Only one component of the system, the library cells, is 
allowed to evolve. The screening is generally carried out in 
a two-dimensional immobilized format, such as on plates. The 
products of the metabolic pathways er.coded by these genes (in 
this case, usually secondary metabolites such as antibiotics, 
polyketides, carotenoics, etc.; diffuse out of the library 
cell to the reporter cell. Tne product c: the library cell 
may affect the reporter cell in one of a number of ways. 
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The assay system (indicator cell) can have a simple 
readout (e.g., green fluorescent protein, luciferase, /?- 
galactosidase) which is induced by the library cell product 
but which does not affect the library cell. In these examples 
the desired product can be detected by colorimetric changes in 
the reporter cells adjacent to the library cell. 

In other embodiments, indicator cells can in turn 
produce something that modifies the growth rate of the library 
cells via a feedback mechanism. Growth rate feedback can 
detect and accumulate very small differences. For example, if 
the library and reporter cells are competing for nutrients, 
library cells producing compounds co inhibit the growth of the 
reporter cells will have more available nutrients, and thus 
will have more opportunity for growth. This is a useful 
screen for antibiotics or a library of polyketide synthesis 
gene clusters where each of the library cells is expressinc 
and exporting a different pclykecide gene product. 

Another variation cf this theme is that the 
reporter cell for an antibiotic selection can itself secrete a 
toxin or antibiotic that inhibits growth of the library cell. 
Production by the library cell cf an antibiotic that is able 
to suppress gro-^'th cf the reporter cell will thus allow 
uninhibited growth of the library cell. 

Conyersely, i: che library is being screened for 
production cf a co.-.pounc thai stir.ulaces the growth of the 
reporter cell (for oxar.cle, in ixproymg che:7,ical syntheses, 
wne iiorary cell rr^ay supply nutrients such as amino acids to 
an auxotrophic reporter, or growth factors to a growth- factor - 
dependent reporter. The reporter cell in turn should produce 
a compound that stirr.ulates the growth of the library cell. 
Interleukins, growth factors, and nutrients are possibilities. 

Further possibilities include competition based on 
ability to kill surrounding cells, pcsitiye feedback loops in 
which the desired product r^ade hy zhe evolved cell stimulates 
the. indicator cell to produce a positive growth factor for 
cell A, thus indirectly selecting for increased product 
formation. 
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In some embodiments of che invention it can be 
advantageous to use a different organism (or genetic 
background) for screening than the one that will be used in 
the final product. For example, markers can be added to DNA 
constructs used for recursive sequence recombination to make 
the microorganism dependent on the constructs during the 
improvement process, even though those markers may be 
undesirable in the final recombinant microorganism. 

Likewise, in some embodiments it is advantageous to 
use a different substrate for screening an evolved enzyme thar. 
the one that will be used in the final product. For example, 
Evnin et al . (Proc. Na tl. Acad. Sci . U.S.A. 87:6659-6663 
(1990)) selected trypsin variants with altered substrate 
specificity by requiring that variant trypsin generate ar. 
essential amino acid for an arginine auxotroph by cleavina 
arginine /?-naphthylamide . This is thus a selection for 
arginine-speciiic trypsin, wLzh :he growth race of the host 
being proportional to that cf che enzyme activity. 

The pool of jells sur^viving screening and/or 
selection is enriched for recorr±)inant genes conferring the 
desired phenot^^e (e.g. alcered substrace specificity, altered 
biosynthetic ability, ecc.}. Further enrichment can be 
cbcamed, if desired, by performing a second round of 
screening and/or selection wichouc generating additional 
diversity. 

The recorJDir.anc gene cr pool of such genes 
surviving one round cf screening/selection forms one or more 
of the substrates for a second round of recombination. Again, 
recombination can be perform.ed in vivo or in vi cro by any of 
the recursive sequence recomJomacion formats described above. 
If recursive sequence recombination is performed in vicro^ the 
recombinant gene or genes to form che substrate for 
recomiDination should be excracced from the cells in which 
screening/selection was perfcr~ed. Optionally, a subsequence 
of, such gene or genes can be excised for more targeted 
subsequent recombination. If the recombinant gene(s) are 
contained wichin episomes, their isolation presents no 



wo 97/35966 



PCTaJS97/047l5 



32 

difficulties. If the recombinant genes are chiornosomally 
integrated; they can- be isolated by amplification primed from 
known sequences flanking the regions in which recombination 
has occurred. Alternatively, whole genomic DNA can be 
5 isolated, optionally amplified, and used as the substrate for 
recombination. Small samples of genomic DNA can be amplified 
by whole genome amplification with degenerate primers {Barrett 
ec'al. Nucleic Acids Research 23:3488-3492 (1995)). These 
primers result in a large amount of random 3' ends, which can 

10 undergo homologous recombination when reintroduced into cells. 

If the second round of recombination is to be 
performed in vivo, as is often the case, it can be performed 
in the cell surviving screening/selection, or the recombinant 
genes can be transferred to another cell type (e.g., a cell 

15' type having a high frequency of mutation and/or 

recombination) . In this situation, recombination can be 
effected by introducing additional DNA segment (s) into cells 
bearing the recombinant genes. In other methods, the cells 
can be induced to exchange genetic information with each other 

20 by, for example, elec tropora t ion . In some methods, the second 
round of recombination is perfcrrr.ed by dividing a pool of 
cells surviving screening/selection in the first round into 
two subpopuia t ions . DN'A from one subpopula t ion is isolated 
and trans fee tec into the o::her population, where the 

25 recombinant genets) from, the two subpopulat ions recombine to 
form a further library of recombinant genes. In these 
methods, it is not necessary to isolate particular genes from 
uhe first subpopulat ion or to take steps to avoid random 
shearing of DNA during exeraction. Rather, the v;hole genome 

3 0 of DNA sheared or otherwise cleaved into manageable sized 

fragments is transfected into the second subpopulat ion . This 
approach is particularly useful when several genes are being 
evolved simultaneously and/or the location and identity of 
such genes within chromosome are noc known. 

35 ' The second round of recombination is sometimes 

performed exclusively among the recombinant molecules 
surviving selection. However, in other embodiments, 
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additional substrates "can be introduced. The additional 
substrates can be of the same form as the substrates used in 
the first round of recombination, i.e., additional natural or 
induced mutants of the gene or cluster of genes, forming the 
substrates for the first round. Alternatively, the additional 
substrate (s) in the second round of recombination can be 
exactly the same as the substrate (s) in the first round of 
replication. 

After the second round of recombination, 
recombinant genes conferring the desired phenotype are again 
selected. The selection process proceeds essencially as 
before. If a suicide vector bearing a selective marker was 
used in the first round of selection, the same vector can fa- 
used again. Again, a cell or pool of cells surviving 
selection is selected. If a pool of cells, the cells can be 
subject to further enrichment:, 

^ ^ ' • Recursive Sequence P!^c orri3inat ion of Genes For 
Bioremediat ion 

Modern industry a-nerates many pollutants for which 

Che environment can no longer be considered an infinite sink, 
u'aturally occurring rr.i c roorgan i srr.s are able to metabolize 
thousands of organic co-pounds, -inoluding many not found in 
nature (e.g xenobiot i os : . Biore.-.ediat ion , the deliberate use 
o: rr.icroorganisns :or the biodeg rada t ion of man-made wastes, 
IS an e-ergmg cecnnology that offers cost and practicality 
advantages over traditional methods of disposal. The success 
of bioremediation depends on the availability of organisms 
that are able to detoxify or mineralize pollutants. 
Microorganisms capable cf degrading specific pollutants can be 
generated by genetic encineerinc and recursive sequence 
recombination . 

Although bioremediation is an aspect of pollution 
control, a more useful appr-aih m the long term is one of 
prevention before industrial waste is pumped into the 
environment. Exposure cf industrial waste streams to 
recursive sequence recombmat ion -generated microorganisms 
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-capable of degrading the pollutants they contain would result 
in detoxification of mineralization of these pollutants before 
the waste stream enters the environment. Issues of releasing 
recombinant organisms can be avoided by containing them within 
bioreactors fitted to the industrial effluent pipes. This 
approach would also allow the microbial mixture used to be 
adjusted to best degrade the particular wastes being produced.. 
Finally, this method would avoid the problems of adapting to 
the outside world and dealing with competition that face many 
laboratory microorganisms. 

In the wild, microorganisms have evolved new 
catabolic activities enabling them to exploit pollutants 
as nutrient sources for which there is no competition. 
However, pollutants that are present at low 
concentrations in the environmenc may not provide a 
sufficient advantage to stirr.ulate the evolution of 
catabolic enzymes. ror a review o: such naturally 
occur rinc evolution of biccecrada t ive pathways and the 
manipulation of some of -ic rcorcnr.isms by classical 
techniques, see Ramos er al., 5io /Technolccv 12:1349-1355 
(1994 ) . 

Generation of nev; catabolic enz\^ies or pathways 
for bioremediat ion has thus relied upon deliberate 
transfer of soecific cenes between organisms (Wackett et 
al . , suora} , forced rr.a:.ing3 between bacteria wich 
specific catabolic capabilities (Brenner et al. 
Siodecradaticn 5:359-377 (1994)). or prolonged selection 
in a chemostat. Some researchers have attempted to 
facilitate evolution via naturally occurring genetic 
mechanisms in cheir chemostat selections by including 
microorganisms with a variety of catabolic pathways 
(Kellogg et. al . Science 214:1133-1135 (1981); 
Chakrabarty American Society of Micr o. Biol. News 
62:130-137 (1995)). For a review of efforts in this 
area, see Cameron et al . Aoolied Biochem. Biotech. 
38 :105-140 (1993) . 
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Current efforts in improving organisms for 
bioremediation take a labor-intensive approach in which 
many parameters are optimized independently, including 
transcription efficiency from native and heterologous 
promoters, regulatory circuits and translational 
efficiency as well as improvement of protein stability 
and activity (Timmis et al . Ann. Rev. Microbiol. 48:525- 
527 (1994) ) . 

A recursive sequence recombination approach 
overcomes a number of limitations in the bioremediation 
capabilities of naturally occurring microorganisms. 
Both enzyme activity and specificity can be altered, 
simultaneously or sequentially, by the methods of the 
invention. For example, catabolic enzymes can be evolved 
to increase the rate ac which they act on a substrate. 
Although knowledge of a rate - limiting step in a metabolic 
pathway is not required to practice the invention, rate- 
limit ing _ proteins in pathways can be evolved to have 
increased expression and/or aczivizy, the requirement for 
inducing substances can be eliminated, and enz>Tnes can be 
evolved that catalyze novel reactions. 

Some examples cf chemical targets for 
oicremediation include cue are not limited to benzene, 
>:ylene, and toluene, ca~phcr, naphthalene, halogenated 
nydroca rbons , pol ychlcr i na ted bichenvls (PCBs), 
:richlorethylene , pesticides such as pen: achloropheny 1 s 
(?C?s) , and herbicides such as atrazine. 

A . Aroma tic Hydrocarbons 

Preferably, when an enzymne is "evolved" to 
have a new catalytic function, chat: function is 
expressed, either const itutively or in response to the 
new substrate. Recursive secuence recombination subjects 
both structural and regulatory elements (includina the 
structure of regulatory proteins) of a protein to 
recombinogenic mutagenesis simultaneously. Selection of 
mutants that are efficiently able to use the new 
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substrate as a nutrient source will be sufficient to 
ensure that both the enzyme and its regulation are 
optimized, without detailed analysis of either protein 
structure or operon regulation. 

Examples of aromatic hydrocarbons include but 
are not limited to benzene, xylene, toluene, biphenyl, 
and polycyclic aromatic hydrocarbons such as pyrene and 
naphthalene. These compounds are metabolized via 
catechol intermediates. Degradation of catechol by 
Pseudoirtonas putida requires induction of the catabolic 
operon by cis, cis-muconate which acts on the CatR 
regulatory protein. The binding site for the CatR 
protein is G-N^,-A. while the optimal sequence for the 
LysR class of activators (of which CatR is a meraber) is 
T-N.,-A. Mutation of the G co a T in the CatR binding 
site enhances the expression of catechol metabolizing 
genes (Chakrabarty , American Societv of MicrobioloQv News 
62:130-137 (1996)). This derons : ra t es that the control 
of existing catabolic pachwa'/s is not optimized for the 
metabolism of specific xenobiotics. It is also an 
example of a t^/pe of mutant chat would be expected from 
recursive seauence recorJDinat ion of the operon followed 
by selection of bacteria that are better able to degrade 
the target compound. 

As an exanc'le of starting materials, 
cicxyqenases are re^q^aired for many pathways in which 
aromatic compounds are cataboli^ed. Even small 
differences in dioxygenase sequence can lead to 
significant differences in substrate specificity 
(Furukawa et al . J. 3act . 175:5224-5232 (1993); Erickson 
et al. Add. Environ. Micro. 59:3858-3862 (1993)). A 
hybrid enzyme made using sequences derived from two 
"parental" enzymes may possess catalytic activities that 
are intermediate between the parents (Erickson, ibid.), 
or may actually be better than either parent for a 
specific reaction (Furukawa et al. J- Bact . 176:2121-2123 
(1994)). In one of these cases site directed mutagenesis 



wo 97/35966 



PCrAJS97/047I5 



37 



was used to generate a single polypeptide with hybr^'d 
sequence (Erickson, ibid.); in the other, a four subunit 
enzyme was produced by expressing two subunits from each 
of two different dioxygenases (Furukawa, ibid.). Thus 
sequences from one or more genes encoding dioxygenases' 
can be used in the recursive sequence recombination 
techniques of the instant invention, to generate enzymes 
with new specificities. m addition, other features of 
Che catabolic pathway can also be evolved using thes^ 
techniques, simultaneously or sequentially, to ootimize 
cne metabolic pathway for an activity of interest. 

^- Haloaenat-ed Hydrnrar-h^no 

Large quantities of halogenated hydroca-bons 
are produced an.nually for uses as solvents and biocid^s 
These i.Mclude, .n the United States alone, over 5 mU-:o. 
tens of both l,2-d.chloroecha..e a.nd vmyl chloride used 
PVC production in the U.S. alone. The co.T.oounds a^^ 
largely net biodegradable by processes in single 
organisms, although in principle haloarcmatic catabolic 
.i^athways can be ccnscrucced by cor^bininc .genes from 

^^^ci._s..s. ^..^v^os can oe manioulated to 

^i^--.ies. Recursive seouer.ce 

reconomation offers the possibility of tailoring enzv^o 

soeciiic'tv^ono-'o---^^-,^ ' . 

ic.c/ .o .... rates wirr.out needing detailed 

structural analysis of the enzv-.es . 

As an example of possible starting materials 
:cr tne methods of the instant invention, Wackett et al 
(Nature 368:627-629 (195,,)) recently demonstrated that ' 
tnrough classical techniques a recom^^inant Pseudomonas 
strain m which seven ge.nes encoding two multi -comoone-^t 
oxygenases are combined, generated a single host that can 
-.etaoolize polyhalogenated compounds by secuential 
reductive and oxidative tecnniq.es to yield non-to-ic 
proGucts. These and/or related materials can b« 
subjected to the techniques discussed above so as to 
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- evolve and optimize a biodegradat i ve pathway in a single 
organism. 

Trichloroethylene is a significant groundwater 
contaminant. It is degraded by microorganisms in a 
5 cometabolic way (i.e., no energy or nutrients are 

derived) . The enzyme must be induced by a different 
compound (e.g., Pseudomonas cepacia uses 
toluene-4 -monoxygenase, which requires induction by 
toluene, to destroy trichloroethylene) . Furthermore, the 

10 degradation pathway involves formation of highly reactive 
epoxides that can inactivate the enzyme (Timmis et al. 
Ann. Rev. Microbiol. 48:525-557 (1994)). The recursive 
sequence recombination techniques of the inve.ntion could 
be used to mutate the enzv^.e ar.d its regulatory region 

15 such that it is produced ccr.stiiut ively . and is less 

susceptible to epoxide inacciva: ion , In some embodiments 
of the invention, selection c: hosts const i tut ively 
producing the enzyme and lesz susceptible to the epoxides 
can be accomplished by de^.animg growth in the presence 

20 of increasing concentrations of trichloroethylene in the 

absence of inducing substances. 

C . Polvchlor inated Biohenvls (PCBs) and 

Polvcvclic Aromatic Hydrocarbons (PAHs) 

25 PCBs and P.-J^s are families of structurally 

related corr.pounds thai are rr.ajcr pollutants at many 
Superfund sites. Bacteria transformed with pi asm ids 
encoding enzyTTies with broader substrate specificity have 
been used commercially. In nature, no known pathways have 

3 0 been generated in a single host that degrade the larger 

PAKs or more heavily chlorinated PCBs. Indeed/ often the 
collaboration of anaerobic ar.d aerobic bacteria are 
required for complete me t arc 1 i sr. . 

Thus, likely scurcos for starting material for 

35 recursive sequence reccmbina*: i cn include identified genes 

encoding PAH-degrading catanoiic pathways on large (20- 
100KB) plasmids (Sanseverino et al. Applied Environ. 
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f^i<^rQ- 59:1931-1937 (1993); Simon ec al. Gene 127:31-37 

(1993) ; Zylstra et al. Annals of the NY Acad. Sci . 
721:386-398 (1994)); while biphenyl and PCB-metabolizing 
enzymes are encoded by chromosomal gene clusters, and in 
a number of cases have been cloned onto plasmids (Hayase 

^1- J. Bacteriol. 172:1160-1164 (1990); Furukawa et 
al. Gene 98:21-28 (1992); Hofer et al. Gene 144:9-16 

(1994) ). The materials can be subjected to the 
techniques discussed above so. as to evolve a 
biodegradative pathway in a single organism. 

Substrate specificity in the PCfi pachway 
largely results from enzymes involved in initial 
dioxygenation reactions, and can be significantly altered 
by mutations in those enz>T*.es (Erickson ec al. Applied 
Environ. Micro. 59:3858-38662 (1993); Furukawa ec al . 
Bact_^ 175:5224-5232 (1993). Mineralization of PAHs and 
?C3s requires that the downstream, pathway is able to 
metabolize the products cf the initial reaction (Brenner 

3iQdeGrad3;ion 5:359-377 (1994)). In this case, 
recursive sequence recon-jDina : ion of the entire pathway 
with selection for bacteria able to use the ?CB -or PAH as 
the sole carbon source will allow production of novel PC3 
Br,z PAH deoradinc bacteria. 

*- ^ D . Herbic id^s 

A general rr.ethod for evolving genes for the 
catabolisn of insoluble herbicides is exerr.pl i f ied as 
follows for atrazine. Atrazine [2 -chloro-4 - (e thylamino) - 
6- (isopropylarr.ino) -1, 3, S-triazme] is a moderately 

30 persistent herbicide which is frequently detected in 

ground and surface water at concentrations exceeding the 
3 ppb health advisory level set by the EPA. Atrazine can 
be slowly metabolized by a Pseudo.Tsonas species 
(Mandelbaum et al . Aool . Environ . Xicro . 61:14 51-1/^57 

35 (1995))^ -p^g enz\anes catalyzing the first two steps in 

atrazine metabolism by PseudoT.onas are encoded by genes 
A.tzA and Atz3 (de Souza et al. Anol . Environ. Micro. 



20 
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- 61:3373-3378 (1995)). These genes have been cloned in a 
6.8 )cb fragmenc into pUClS (AtzAB-pUC) . E. coli carrying 
this plasmid converts atrazine to much more soluble 
metabolites. It is thus possible to screen for enzyme 
5 activity by growing bacteria on plates containing 

atrazine. The herbicide forms an opaque precipitate in 
the plates, but cells containing AtzAB-pU18 secrete 
atrazine degrading enzymes, leading to a clear halo 
around those cells or colonies. Typically, the size of 

10 the halo and the rate of its formation can be used to 
assess the level of activity so that picking colonies 
with the largest halos allows selection of the more 
active or highly produced atrazine degrading enzymes. 
Thus, the plasmids carrying these genes can be subjected 

15 CO the recursive sequence recorrjDina t ion formats described 

above to optimize the catabolis" of atrazine in E. coli 
or another host of choice, including Pseudomonas, After 
each round of recorrLbinacior. , screening of host colonies 
expressing the evolved genes car. be done on agar plates 

20 containing atrazine to observe halo formation. This is a 
generally applicable mechod for screening enzymes that 
"lecabolize insoluble co-pounds to those that are soluble 
(e.g.', polycyclicaromat ic hydrocarbons). Additionally, 
catabolism of atrazine can pro\'ide a source of nitrogen 

25 for the cell; if no ether nitrogen is available, cell 

crcv.'th v;ill be lir.itec by the rate at which the cells can 
catabolize nitrogen. Cells able to utilize atrazine as a 
nitrogen source can thus be selected frorr. a background of 
n on- utilizers or poor-utilizers. 

30 

E . Heavy Metal Detoxification 
Bacteria are used corr.me rc ia 1 ly to detoxify 
arsenate waste generated by th:e rr.ining of arsenopyrite 
cold ores. As v/ell as rr.ining effluent, industrial waste 
35 water is often contamina _ec v.ith heavy metals (e.g., 

those used in the manufacture of electronic components 
and plastics). Thus, simply to be able to perform other 
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bioremedial functionsT microorganisms must be resistant 
to the levels of heavy metals present, including 
mercury, arsenate, chromate, cadmium, silver, etc. 

A strong selrctive pressure is the ability to 
metabolize a toxic compound to one less toxic. Heavy 
metals are toxic largely by virtue of their ability co 
denature proteins (Ford ec al . Bioextrarr i nn ;,nH 
■ Biodeterioration of Merals , p. 1-23) . Detoxification of 
heavy metal contamination can be effected in a number of 
ways including changing the solubility or 
bioavailability of the metal, changing its redox state 
(e.g. toxic mercuric chloride is detoxified by reduction 
to the much more volatile elemental mercury) and even by 
bioaccumulation of the metal by immobilized bacteria or" 
plants. The accumulation of netals to a sufficiently 
high concentration allo-.-s mecal co be recycled; smelting 
burns off the organic par- of zr.e organism, leaving 
behi.'.d reusable accumulated r.etal. Resistances to a 
num±>er of heavy metals (arsenate, cadmium, cobalt, 
chromium, copper, mercury, .-.-ckei, lead, silver, and 
::inc) are plasmid e.-^coded in a r.urr.ber of species 
including Staphylococcus and Pseucomonas (Silver et al. 
S nviron. Health porsn^- :;^ 102:107-113 (1S94); Ji et al . 

1-5:61 -75 ( 1995)). These genes also confer 
heavy r,etal resistance zr. other species as well (e.g., E. 

s..-. se__ence reco-jDinat icn techniques 

of the instant invention (RSR) can be used to increase 
microbial heavy metal tolera.nces, as well as to increase 
-he e:<tent to which cells will atcuxulate heavy metals. 
For example, the ability of coli co detoxify arsenate 
can be improved at least 100- fold by RSR (see co-pendinc 
Application Serial .n;o . 03/^21, 55.^, filed .Varch 25, 1996) . 

Cyanide is very efficiently used to extract 
gold from rock containing as l.^zle as 0.2 oz per ton. 
This cyanide can be m.icrcbiaiiy neutralized and used as a 
nitrogen source by fungi or bacteria such as Pseudomonas 
fluorescens. "A probierr, with microbial cyanide 
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degradation is the presence of toxic heavy metals in the 
leachate. RSR can be used to increase the resistance of 
bioremedial microorganisms to toxic heavy metals, so that 
they will be able to survive the levels present in many 
industrial and Superfund sites. This will allow them to 
biodegrade organic pollutants including but not limited 
to aromatic hydrocarbons, halogenated hydrocarbons, and 
biocides . 

F . Microbial Mining 

"Bioleaching" is the process by which microbes 
convert insoluble metal deposits (usually metal sulfides 
or oxides) into soluble metal sulfates, Bioleaching is 
commercially important in the mining of arsenopyr i te , but 
has additional potential in the detoxification and 
recovery of metals and acids from waste dumps. Naturally* 
occurring bacteria capable of bioleaching are reviewed by 
Rawlings and Silver ( 3 io/TechnoIccv 13:773-778 (1995)). 
These bacteria are t\T>ically divided into groups by their 
preferred temoeratures for growth. The more important 
mesophiies are Thicbacillus and Leptospi ri 1 Ivjt] species. 
Moderate thermophiles include Sal fobacillus species. 
Extreme thermophiles include Si:lfolobus species. Many of 
these organisrr.s are difficult to grow in commercial 
industrial settings, making their cataboiic abilities 
attractive candidates for transfer to and optimization in 
other organisms such as Pseudor.onas , Rhodococcus , T, 
ferrooxidans or E. coii . Genetic systems are available 
for at least one strain of T. ferrooxidans , allowing the 
m.anipulat ion of its genetic m.aterial on plasmids. 

The recursive sequence recombination m.ethods 
described above can be used to optimize the catalytic 
abilities in native hosts or heterologous hosts for 
evolved bioleaching genes or pathways, such as the 
a*bility to convert metals from insoluble to soluble 
salts. In addition, leach rates of particular ores can 
be improved as a result of, for example, increased 
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resistance to toxic Compounds in chs ore ccicentrate, 
increased specificity for certain substrates, ability to 
use different substrates as nutrient sources, and so on. 

G . Oil Desulf urizar-i nn 

The presence of sulfur in fossil fuels has been 
correlated with corrosion of pipelines, pumping, and 
, refining equipment, and with the premature breakdown of 
combustion engines. Sulfur also poisons many catalysts 
used in the refining of foss-ii fuels. The atmospheric 
emission of sulfur combustion products is known as acid 
rain . 

Microbial desulf urizacion is an appealing 
bioremediacion applicatio.n . Several bacteria have been 
reported chat are capable of cacabolizing 
dibenzothiophene (D3T) , which is the representative 
corr.pound of the class of s-f^^r compounds found in foss- 
fuels. U.S. Patent No. 5. 501 discloses the cloninc 
of a DN-A xoiecule fro:., .-c.rcccc.s rhodochrous cacable'of 
oiocatalyzir.g the desulf'^riration of oil. Denome et al 
(Gene 175:6890-6901 (: = 95i; disclose the cloning of^ 
9.8 kb D^:a fragment fr- Psendo-onas encoding the uoper 
naphthalene catabolizi.ng pathway which also degrades" 
cibenzothiophe.-e. Ctr.er ge.nes have been identified that 

T.-.e activity cf these er.zyr.es is currently too 
iow to be comr.erciaiiy viable, but the pathway could b- 
increased in efficiency usi.-.g the recursive seouence 
recombination techniques of the invention. The desired 
property of the genes of interest is their ability to 
desulfurize dibenzothicphene or its alkyl or arvl' 
substituted analogues. Ir. scn-.e er^Dodiments of the 
invention, selection is preferaoly accomplished by 
coupling this pathway to o.-.. providi.ng a nutrient to the 
bacteria. Thus, for exarr-.pic-, desulf urization of 
dibenzothicphene results in formation of hydroxybiohenyl . 
This is a substrate for the biphenyl -catabolizing oathway 
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which provides carbon and energy. Selection would thus 
be done by "shuffling" the dibenzothiophene genes and 
transforming them into a host containing the biphenyl- 
catabolizing pathway. Increased dibenzothiophene 
5 desulf urizat ion will result in increased nutrient 

availability and increased growth rate. Once the genes 
have been evolved they are easily separated from the 
biphenyl degrading genes. The latter are undesirable in 
the final product since the object is to desulfurize 

10 without decreasing the energy content of the oil. Alkyl 
or aryl substituted dibenzothiophenes can be detected by 
changes in fluorescence (Krawiec, S., Devel . Indus . 
Microbioloav 31:103-114 (1990)) or by detection of phenol 
groups formed as a result of desul f urizat ion (Dacre, J.C. 

IS' A.-al ■ Chen. 43:589-591 (1971)). 

H . Grqano-Nitro Con-.oounds 

Organo- nitre coT.pour.ds are used as explosives, 
dyes, drugs, polyrr.ers ar.d anc i~icrobi a 1 agents. 

20 Biodegrada t ion of these cc-pz'^r.ds occurs usually by way 

of reduction of the r. itra::e croup, catalyzed by 
nitroreductases, a family of broadly -specific enzymes. 
Partial reduction of" orcano-nitro corr.pounds often results 
in the formation of a cor.pound more toxic than the 

7:5 original (Hassan er al. 1979 Arch Bioch 5ioo. 196 : 3 3 5 - 

3 95) . Recursive secuer.ce recomJoina t ion of 
nitroreductases can produce enzvTries that are more 
specific, and able to r.ore completely reduce (and thus 
detoxify) their target compounds (examples of which 

30 include but are not limited to nit rotoluenes and 

ni t robenzenes ) . Nitro- reductases can be isolated from 
bacteria isolated from explosive -contaminated soils, such 
as /'j'organeila morganii and ^ncerobacter cloacae (Bryant 
et. al., 1991. J. Biol Chem . 255:4126-4130) . A preferred 

35 . selection m.ethod is to look for increased resistance to 
the organo-nitro comioound of interest, since that will 
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indicate that the enzyme is also able to reduce any toxic 
partial reduction products of the original compound. 

Use of Alternative Substrates for Chemical Synthf^.g^c; 
Metabolic engineering can be used to alter 
microorganisms that produce industrially useful 
chemicals, so that they will grow using alternate and 
more abundant sources of nutrients, including human- 
produced industrial wastes. This typically involves 
providing both a transport system to get the alternative 
substrate into the engineered cells and catabolic enzymes 
from the natural host organisms to the engineered cells. 
In some instances, enzymes can- be secreted into the 
medium by engineered cells to degrade the alternate 
substrate into a form that can more readily be taken u? 
by the engineered cells; i.n other instances, a batch of 
engineered cells can be crown cn one preferred substrate, 
Chen lysec to liberate hydrolytic enzyrr.es for the 
alternate substrate into the medium, while a second 
inoculum of the same engineered host or a second host is 
added to utilize the hvdrcivzate. 

Tne starting materials for recursive sequence 
recomomation will tvpically be genes for utilization of 
a suostrate or -ts transport. Examples of nutrient 

sources of ^ ^--^c^- ^..^ _^ 

-Q- iimitec to 

lactose, whey, galactose, mannitol, xylan, cellobiose, 
cellulose and sucrose, thus allowing cheaper production 
of compounds including but not limited to ethanol, 
tryptophan, rhamnolipid surfactants, xanthan gum, and 
polyhydroxylalkanoate. For a review of such substrates 
as desired target substances, see Cameron et al. ( Aool , 
B iochem . Bioterhnnl 33:105-l'iO (1S93)). 

The recursive sequence recombination methods 
described above can be used to optimize the ability of 
n.ative hosts or heterologous hosts to utilize a substrate 
of interest, to evolve more efficient transport systems. 
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to increase or alter specificity for certain substrates, 
and so on. 

V . Biosynthesis 

Metabolic engineering can be used to alter 
organisms to optimize the production of practically any 
metabolic intermediate, including antibiotics, vitamins, 
amino acids such as phenylalanine and aromatic amino 
acids, ethanol, butanol, polymers such as xahthan gum and 
bacterial cellulose, peptides, and lipids. When such 
compounds are already produced by a host, the recursive 
sequence recombination techniques described above can be 
used to optimize production of the desired metabolic 
intermediate, including such features as increasing 
enzyme substrate specificity and turnover number, 
altering metabolic fluxes to reduce the concentrations of 
toxic substrates or intermediates, increasing resistance 
of the host to such toxic ccr.pounds, eliminating, 
reducing or altering the need for inducers of gene 
expression/activity, increasing the production of enzym.es 
necessary for netabol isrr. , etc. 

Enzymes can also be evolved for improved 
activitv in solvents other than water. This is useful 
because interr-.ediates in cher.ical syntheses are often 
orotected by blocking crcups which dramatically affect 
the solubility of the ccr.pound in aqueous solvents. Many 
coxDounds can be p reduced by a combination of pure 
chemical and enzyrr.ical ly catalyzed reactions. Performing 
enzymic reactions on almost insoluble substrates is 
clearly very inefficient, so the availability of enzymes 
that are active in other solvents will be of great use. 
One example of such a sche-e is the evolution of a para- 
nitrobenzyl esterase to rer.ove protecting groups from an 
intermediate in loracarbef synthesis (Moore, J.C. and 
Arnold, r.H. Nature 3 iot echnoioav 14:458-467 (1996)). 
In this case alternating rounds of error-prone PGR and 
colony screening for production of a fluorescent reporter 
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from a substrate analogue were used to generate a mutant 
esterase that was 16-fold more active than the parent 
molecule in 30% dimethylf ormamide . No individual 
mutation was found to contribute more than a 2-fold 
increase in activity, but it was the combination of a 
number of mutations which led to the overall increase. 
Structural analysis of the mutant protein showed that the 
amino acid changes were distributed throughout the length 
of the protein in a manner that could not have been 
rationally predicted. Sequential rounds of error-prone 
PCR have the problem that after each round all but one 
mutant is discarded, with a concomitant loss of 
information contained in all the other beneficial 
mutations. Recursive sequence recombination avoids this 
problem, and would thus be ideally suited to evolving 
enzymes for catalysis in other solvents, as well as in 
conditions where salt cor.cer.t rat ions or pH were different 
from the original enzNcr.e cc::i"as. 

In addiCic, the yield of almost any metabolic 
pathway can be increased, whether consisting entirely of 
genes endogenous to the hose organisms or all or partly 
heterologous genes. Opci-ization of the expression 
levels of the enzyrr.es in a pathway is more complex than 
simply maximizing expression. In some cases regulation, 
rather than constitutive expression of an enzyme may be 
advantageous for cell growth and therefore for product 
yield, as seen for production of phenylalanine (Backman 

Ann. NT Acad. Sci . 589:16-24 (1990)) and 2-keto-L- 
ciuconic acid (Anderson et al . U.S. 5,032,514). In 
addition, it is often advantageous for industrial 
purposes to express proteins m organisms other than 
their original hosts, riew host strains may be preferable 
for a variety of reasons, including ease of cloning and 
transformation, pathogenicity, ability to survive in 
particular environments and a knowledge of the physiology 
and genetics of the organisr.s. However, proteins 
expressed in heterologous organisms often show markedly 
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reduced activity for a variety of reasons including 
inability to fold properly in the new host (Sarchy et al. 
ApdI. E nviron. Micro. 53:1996-2000 (1987)). Such 
difficulties can indeed be overcome by the recursive 
sequence recombination strategies of the instant 
invention . 

A . Antibiotics 

The range of natural small molecule antibiotics 
includes but is not limited to' peptides, pept idolactones , 
thiopep tides , beta -lactams, glycopeptides , 1 antibiotics , 
microcins, polyke t ide -derived antibiotics (anthracycl ins , 
te t racyclins , macrolides, avermectins, polyethers and 
ansamycins) , chloramphenicol, aminoglycosides, 
aminocycli tols , polyoxins, agrocins and isoprenoids. 

There are at least three ways in which 
recursive sequence recorrJbir.ar ion techniques of the 
instant invention can be use:: to facilitate novel drug 
synthesis, or to improve bicsynthesis of existing 
antibiotics . 

First, antibiotic synthesis enzymes can be 
"evolved" together with transport systems that allow 
entry of compounds used as antibiotic precursors to 
improve uptake and incorporation of funct ion - alter ing 
artificial sice chain precursors. For exam.ple, 
penicillin V is produce c by feeding Panic i 11 iusu the 
artificial side chain precursor ph e noxyacet ic acid, and 
LY14 6032 by feeding S treptorrivces roseosporus decanoic 
acid (Hopwood, Phil. Trans. ?. . Soc . Lond . 3 324:549-562 
(1939)) . Poor precursor uptake and poor incorporation by 
the synthesizing enzyme often lead to inefficient 
formation of the desired product. Recursive sequence 
recombination of these two systems can increase the yield 
of desired product. 

Furthermore, a combinatorial approach can be 
taken in which an enzyme is shut fled. for novel catalytic 
activity/substrate recognition (perhaps by including 
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randomizing oligonucleotides in key positions such as the 
active site) . A nui-nber of different substrates (for 
example, analogues of side chains that are normally 
incorporated into the antibiotic) can then be tested in 
combination with all the different enzymes and tested for 
biological activity. In this embodiment, plates are made 
containing different potential antibiotic precursors 
(such as the side chain analogues) . The microorganisms 
containing the shuffled library (the library strain) are 
replicated onto those plates, together with a competing, 
antibiotic sensitive, microorganism (the indicator 
strain) . Library cells that are able to incorporate the 
new side chain to produce an effective antibiotic will 
thus be able to compete with the indicator strain, and 
will be selected for. 

Second, the expression of heterologous genes 
transferred from one antibiocic synthesizing organism to 
another can be optimized. The newly introduced enzyme (s) 
act on secondary metabolites in the host cell, 
transforming them into new co-pounds with novel 
properties. Using traciticnal methods, introduction of 
foreign genes into antibiotic synthesizing hosts has 
already resulted in the product.icn of novel hybrid 
antibiotics. Examples include mederrhcdin, 
cmycrogranat irhodin , 5 - ceoxyery thromycin A, 
^sovalerylspiramycm and other hybrid macrolides (Cameron 

Aool. Biochem. Biotechnoj. . 38:105-140 (1993)). 
The recursive sequence recombination techniques of the 
instant invention can be used to optimize expression of 
the foreign genes, to stabilize the enz^/me in the new 
host cell, and to increase the activity of the introduced 
enzyme against its new substrates in the new host cell. 
In some embodiments of the invention, the host genome may 
also be so optim.ized. 

Third, the substrate specificity of an enzyme 
involved in secondary metabolism can be altered so that 
It will act on and modify a new compound or so that its 
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activity is changed and it acts at a different subset of 
positions of its normal substrate. Recursive sequence 
recombination can be used to alter the substrate 
specificities of enzymes. Furthermore, in addition to 
recursive sequence recombination of individual enzymes 
being a strategy to generate novel antibiotics, recursive 
sequence recombination of entire pathways, by altering 
enzyme ratios, will alter metabolite fluxes and may 
result, not only in increased antibiotic synthesis, but 
also in the synthesis of different antibiotics. This can 
be deduced from the observation that expression of 
different genes from the same cluster in a foreign host 
leads to different products being formed (see p. 80 in 
Hutchinson ec. al . , (1991) Ar.n >^r{ Acad Sci , 646:78-93). 
Recursive sequence recombinatiion of the introduced gene 
clusters may result in a variety of expression levels of 
different proteins within che cluster (because it 
produces different combinat icr.s of, in this case 
regulatory, mutations) . This in turn may lead to a 
variety of different end products. Thus, "evolution" of 
an existing antibiotic 3yr.thesi?-ing pathway could be used 
to generate novel antibiotics either by modifying the 
rates or substrate specificities of enzyrrtes in that 
pat n '**.' a y . 

Additionally, ant ibic tics can also be produced 
: n vi c r o by t he action c f a purified e n z \Tr;e on a 
precursor. For example isopeni c i 1 1 in ^J synthase 
catalyses the cyclization of rr.any analogues of its normal 
substrate (d- ( L- a - aminoadipy 1 ) - L- cysteinyl - D- valine ) 
(Hutchinson, Med. Res. ?.ev . 8:557-567 (1988)). Many of 
these products are active as antibiotics. A wide variety 
of substrate analogues can be tested for incorporation by 
secondary metabolite s-znthesizinc enzyrrves without concern 
for the initial efficiency of the reaction. Recursive 
sequence recombination can be used subsequently to 
increase the rate of reaction v;ith a promising new 
substrate , 
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Thus, organisms already producing a desired 
antibiotic can be evolved with the recursive sequence 
recombination techniques described above to maximize 
production of that antibiotic. Additionally, new 
antibiotics can be evolved by manipulation of genetic 
material from the host by the recursive sequence 
recombination techniques described above. Genes for 
antibiotic production can be transferred to a preferred 
host after cycles of recursive sequence recombination or 
can be evolved in the preferred hose as described above. 
Antibiotic genes are generally clustered and are often 
positively regulated, making them especially attractive 
candidates for the recursive sequence recombination 
techniques of the instant inve.-.tion. Additionally, some 
genes of related pathways show cross-hybridization, 
making them preferred candidates for the generation of 
new pathways for new artibic-_ics by the' recursive 
sequence, recomhinacion techr.iqjes of the invention, 
rurthermore, increases in sscondary metabolite production, 
including enhancement of sucs-ate fluxes (by increasing.' 
the rate of a rate ii~ici.-.g enz-.-^^e, deregulation of the 
pathway by suppression of negative control elements or 
over expression of activatcrs and the relief of feedback 
controls by mutation of the regulated enzyme to a 
feedback-insensitive deregulared protein) can be achieved 
by- recursive sequence recorr.bi r.a c ion without exhaustive 
analysis of the regulatory rechanisms governing 
expression of the relevant gene clusters. 

The host chosen for expression of evolved genes 
is preferably resistant to the antibiotic produced, 
although in some instances production methods can be 
designed so as to sacrifice hcs: cells when the amount of 
antibiotic produced is co--ercially significant yet 
lethal to the host. Similarly, bioreactors can be 
designed so that the growth r.edium is continually 
replenished, thereby "drawir^c off antibiotic produced 
and sparing the lives of the producing cells. 
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Preferably, the mechanism of resistance is not the 
degradation of the antibiotic produced. 

Numerous screening methods for increased 
antibiotic expression are known in the art, as discussed 
above, including screening for organisms that are more 
resistant to the antibiotic that they produce. This may 
result from linkage between expression of the antibiotic 
synthesis and antibiotic resistance genes (Chater, 
Bio/Technology 8:115-121 (1990)). Another screening 
method is to fuse a reporter gene (e.g. xylE from the 
Pseudomonas TOL plasmid) co the antibiotic production 
genes. Antibiotic synthesis gene expression can then be 
measured by looking for expression of the reporter (e.g. 
xylE encodes a catechol dioxygenase which produces yellow 
muconic semialdehyde when colonies are sprayed with 
catechol (Zukowski et al. ?roc. Natl. Acad. Sci. U.S.A. 
80:1101-1105 (1983) ) . 

The wide variety of cloned antibiotic genes 
provides a wealth of sLarcinc rr:aierials for the recursive 
sequence recombina:: ion techniques of the instant 
invention. For exaxple, genes have been cloned from 
Sirepcon^yces cattleya which direct cephamycin C synthesis 
in the non - ant ibiot ic producer Screpcomyces lividans 
(Chen et al. 3 io/Technol ocv 6:1222-1224 (1988)). 
Clustered ce.nes for penicillin biosynthesis 
(6- ( L-a-aminoadipyl ) - L- cysteinyl - D- valine synthetase; 
isopenici 1 1 in N syntihetase and acyl coenzyrr.e 
A: 6 -aminopenicillanic acid acylcrans f erase ) have been 
cloned froir^ Penicilliusn chrysogenusn , Transfer of these 
genes into Neurospora crassa and Aspergillus niger result 
in the synthesis of active penicillin V (Smith et al. 
Bio/Technoloov 8:39-41 (1530)) . For a review of cloned 
genes involved in Cephalosporin C, Penicillins G and V 
and Cephamycin C biosynthes is , see Piepersberg, Crit . 
Rev. Biotechnol. 14:251-285 (1934). For a review of 
cloned clusters of ant ibiot ic - produc ing genes, see Chater 
3io/TechnoloGv 8:115-121 (1990). Other examples of 
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antibiotic synthesis genes transferred to industrial 
producing strains, or over expression of genes, include 
tylosin, cephamycin cephalosporin C, LL-E33288 complex 
(an antitumor and antibacterial agent), doxorubicin, 
5 spiramycin and other macrolide antibiotics, reviewed in 
Cameron et al. ApdI . Biochem. Biotechnol . 38:105-140 
(1993) . 

Biosynthesis to Replace Chemica:^ Svnthp.gi.Q 
of Antibiotics 
Some antibiotics are cu'rrently made by chemical 
modifications of biologically produced startino 
compounds. Complete biosynthesis of the desired 
molecules may currently be impractical because of the 
15 lack of an enzyme with the required enzym.atic activity 

and substrate specificity. For example, 
7-aminodeacecooxycephalospcrar.ic acid {7-ADCA) is a 
precursor for semi -synthe t --I ly produced cephalosporins. 
7-ADCA is made by a chemical r:r.g expansion from 
20 penicillin V followed by enzv-acic deacylation of the 

phenoxyacetal croup. Cephalosporins could zn principle 
be produced biolccicaliy froT. their corresponding 
penicillins (e.g., cephalosporin V or from penicillin V 
or G) using penicilli.- expandase, but other penicillins 
>sucn as psniciliir. V cr G) are not used as S'jbstrates by 
••t.-.ow.n expandases. The recursive sequence recombination 
techniques of the invention can be used to alter the 
enzyme so that it will use penicillin V as a substrate. 
Similarly, penicillin transacyiase could be so modified 
to accept cephalosporins or cepha-.ycins as substrates. 

In yet another exar.pie, penicillin amidase 
expressed in E. coli is a key enzyr.e in the production of 
penicillin G derivatives. T.-.c enz'./me is generated from a 
precursor peptide and tends ; accumulate as insoluble 
35 . aggregates in the periplasr, u;-.less non-metabolizable 

sugars are present in the rr.ediuT. (Scherrer et al . aoo1_, 
Microbiol. BiotRchnol . 42:S5-gl (1994)). Evolution of 
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this enzyme through tne methods of the instant invention 
could be used to generate an enzyme that folds better, 
leading to a higher level of active enzyme expression. 

In yet another example, Penicillin G acylase 
covalently linked to agarose is used in the synthesis of 
penicillin G derivatives. The enzyme can be stabilized 
for increased activity, longevity and/or thermal 
stability by chemical modification -( Fernandez-Lafuente 
et. al. Enzyme Microb. Technol . 14:489-495 (1992). 
Increased thermal stability is an especially attractive 
application of the recursive sequence recombination 
techniques of Che instant invention, which can obviate 
the need for the chemical modification of such enzymes. 
Selection for thermostability can be performed in vivo in 
£. coll or in chermophiles at higher temperatures. In 
general, thermostability is a good first step in 
e.nhancing general stabilization of enzymes. Random 
m.utagenesis and selection can also be used to adapt 
enzymes to function in non-aquecus solvents (Arnold Curr 
Ooin Biotechnol . 4:450-455 (1593); Chen et . al. Proc . 
Natl . Acad. Sci . U.S.A. . 90:5515-5622 (1593)). Recursive 
sequence recombination represents a more powerful (since 
r ecombinogen ic ) method of generating m.utant enzymes that 
are stable and active in ncn- aqueous environments. 
Additional screening car. be done on the basis of enzv^.e 
stability in s o 1 \' e n t s . 

C . Polvket ides 

Polyketides include antibiotics such as 
tetracycline ana erythromycin, anti-cancer agents such as 
daunomycin, imtmunosuppressant s such as F:o06 and 
rapamycin and veterinary products such as monesin and 
avermectin. Polyketide synthases iPKS' s) are 
multifunctional enzymes that control the chain length, 
choice of chain-building units and reductive cycle that 
generates the huge variation m naturally occurring 
polyketides. Polyketides are built up by sequential 



5 6 6 A ; 



wo 97/35966 



PCTA;S97/0-<7I5 



55 

transfers of "extender units" (fatty acyl CoA groups) 
onto the appropriate starter unit (examples are acetate, 
coumarate, propionate and malonamide) . The PKS's 
determine the number of condensation reactions and the 
type of extender groups added and may also fold and 
cyclize the polyketide precursor. PKS's reduce specific 
p-keto groups and may dehydrate the resultant /3-hydroxyls 
to form double bonds. Modifications of the nature or 
number of building blocks used, positions at which p-keto 
groups are reduced, the extent of reduction and different 
positions of possible cyclizations , result in formation 
of different final products. Polyketide research is 
currently focused on modification and inhibitor studies, 
site directed mutagenesis and 3-D structure elucidation 
to lay the groundwork for rational changes in enzymes 
that will lead to new polyketide products. 

.Recently, McDaniel et al . ( Science 262:1546- 
1550 (1995)) have developed a S :repcoruyces host-vector 
system for efficient constr— ticn and expression of 
recombinant PKSs . Hutchinson ( 3 io /Technol ncr^^ 12:375-308 
(1994)1 reviewed targeted rr.^jtation of specific 
biosynthetic genes and sijcgested that microbial isolates 
can be screened by D:,-.-\ hybridization for genes associated 
with known pharmacoicrically active agents so as to 
provide new -etabtlites or increased yields of 

a procucec. In particular, that 

review focuses on polyketide synthase and pathways to 
aminoglycoside and oligopeptide antibiotics. 

The recursive sequence recoTJDinat ion techniques 
of the instant invention can be used to generate modified 
enzyrr.es that produce novel polyketides without such 
detailed analytical effort. The availability of the ?KS 
genes on plasmids and the existence of £, coli- 
Screptomyces shuttle vectors (Weh:neier Gene 165: 149-150 
(1995) ) makes the process of recursive sequence 
recon^bination especially attractive by the techniques 
described above. Techniques for selection of antibiotic 
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producing organisms can be used as described above; 
additionally, in some embodiments screening for a 
particular desired polyketide activity or compound is 
preferable . 

D . Isoprenoida 

Isoprenoids result from cyclization of farnesyl 
pyrophosphate by sesquiterpene synthases. The diversity 
of isoprenoids is generated not by the backbone, but by 
control of cyclization. Cloned examples of isoprenoid 
synthesis genes include trichodiene synthase from 
Fusarium sprorocrichioides , pentalene synthase from 
Screpcoruyces , aristolochene synthase from Penicillium 
roquefortii , and epi -aristolochene synthase from N. 
tiabaciLT^ (Cane, D.E. (1995). Isoprenoid antibiotics, 
pages 633-655, in "Genetics and Biochemistry of 
Antibiotic Production" edited by Vining, L.C. Sc Stuttard, 
C, published by Butterworth - Heinemann) . Recursive 
sequence recoT±)inat ion o: sesquiterpene synthases will be 
of use both in allowing expression ot these enzyrr.es in 
heterologous hosts (such as plants and industrial 
rr.icrcbial strains) and in alteration of enzymes to 
change the cyclized product -ade-. A large number of 
isoprenoids are active as antiviral, antibacterial, 
antifuncal, herbicical, insecticidal or cytostatic 
agents. Antibacterial and antifungal isoprenoids could 
thus be oreferablv screened for using the indicator cell 
twe system described above, wirh the producing cell 
competing v;ith bacteria cr fungi ror nutrients. 
.Antiviral isoprenoids could be screened for preferably by 
their ability to confer resistance to viral attack on the 
producing cell. 

E . Rioactive Peptide Derivatives 
Examples of bioactive non - r ibosomal ly 
synthesized peptides include the antibiotics cyclosporin, 
peps tat in, actinomycin, grarriic idin , depsipept ides , 
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vancomycin, etc. These peptide derivatives are 
synthesized by complex enzymes rather than ribosomes. 
Again, increasing the yield of such non-ribosomally 
synthesized peptide antibiotics has. thus far been done by 
genetic identification of biosynthetic "bottlenecks" and 
over expression of specific enzymes (See, for example, p. 
133-135 in "Genetics and Biochemistry of Antibiotic 
Production" edited by Vining, L.C. & Stuttard, C. , 
published by But terworth-Heinemann) . Recursive sequence 
recombination of the enzyme clusters can be used to 
improve the yields of existing bioactive non-ribosomally 
made peptides in both natural and heterologous hosts. 
Like polyketide synthases, peptide synthases are modular 
and multifunctional enzymes catalyzing condensation 
reactions between activated building blocks (in this case 
amino acids) followed by modifications of those building ' 
blocks (see Kleinkauf, H. and von Dohren, H. Eur. J. 
Biochem. 236:335-351 (1995) ). Thus, as for polyketide 
synthases, recursive sequence recorr±)inat ion can also be 
used to alter peptide synthases: modifying the 
specificity of the ammo acid recognized by each binding 
sice on the enzy-e and altering the activity or substrate 
specificities of sites that nodify these amino acids to 
produce novel compounds with antibiotic activity. 

Other peptide antibiotics are made ribosoxally 
and then pos t - t ranslat lonally modified. Examples of this 
t>^e of antibiotics are lantibiotics (produced by gram 
positive bacteria such S caphylococcus , S zrepconyces , 
Bacillus, and Accinoplanes) and microcins (produced by 
Enterobacteriaceae) . Modifications of the original 
peptide include (in lantibiotics) dehydration of serine 
and threonine, condensation of dehydroamino acids with 
cysteine, or simple N- and C-terminal blocking 
(microcins) . For ribosomaily r.ade antibiotics both the 
peptide-encoding sequence and the m.odifying enzymes may 
have their expression levels modified by recursive 
sequence recom.bina t ion . Again, this will lead to both 
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increased levels of antibiotic synthesis, and by 
modulation of the levels of the modifying enzymes (and 
the sequence of the ribosomally synthesized peptide 
itself) novel antibiotics. 
5 Screening can be done as for other antibiotics 

as described above, including competition with a 
sensitive (or even initially insensitive) microbial 
species. Use of competing bacteria that have resistances 
to the antibiotic being produced will select strongly 
10 either for greatly elevated levels of that antibiotic (so 
that it swamps out the resistance mechanism) or for novel 
derivatives of that antibiotic that are not neutralized 
by the resistance mechanism. 

15" F. Polymers 

Several examples of metabolic engineering to 
produce biopolymers have beer, reported, including the 
production of the biodecrad£:bl e plastic 

polyhydroxybutarate (?H3), ar.i zhe polysaccharide xanthan 

20 cum. For a review, see Caxeron et al . Applied Biochen. 

Siotech . 38:105-140 (IS?:). Genes for these pathways 
have been cloned, making them excellent candidates for 
the recursive seo^uence re comjDinat ion techniques described 
above. Expression of such evolved aenes in a 

25 commercially viable hose such as £. coll is an especially 

attractive applicaticr. of this tec hno log^/ . 

Examples of starting materials for recursive 
sequence recomjDinat ion include but are not limited to 
genes from bacteria such as Alcalxgenes , Zoogloea, 

30 Rhizobiun}, Sacillus , and Azobaccer , which produce 

polyhyd roxya 1 ka noa t e 3 (?HAs) such as oolyhyroxybuty rate 
(?HB) int raceliularly as energy reserve m.aterials in 
response to stress. Genes frrm. Alcaligenes eucrophus 
that encode enzymes cataly::ir.g the conversion of 

35 . acetoacetyl CoA to ?H3 have been transferred both to E. 
coll and to the plant Arabidopsis chaliana 
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(Poirier et al. Science 256:520-523 (1992)). Two of these 
genes (phbB and phbC, encoding acetoacetyl -CoA reductase 
and PHB synthase respectively) allow production of PHB in 
Arabidopsis. The plants producing the plastic are 
stunted, probably because of adverse interactions between 
the new metabolic pathway and the plants' original 
metabolism (i.e., depletion of substrate from the 
mevalonate pathway) . Improved production of PHB in 
plants has been attempted by localization of the pathway 
enzymes to organelles such as plastids. Other strategies 
such as regulation of tissue specificity,- expression 
timing and cellular localization have been suggested to 
solve the delete rious erfects of PHB exoression in 
plants. The recursive sequence recom.binat ion techniques 
of the invention can be used to modify such heterologous 
genes as well as specific cloned interacting pathways 
(e.g., mevalonate), and to optimize PHB synthesis in 
industrial microbial strains, for example to rem.ove the 
requirement for stresses (such as nitrogen limitation) in 
growth conditions. 

Additionally, ether microbial polyesters are 
mace by different bacteria in which additional m.onomers 
are incorporated inco the polvTr.er {Peoples et al. in 
.Novel Biodegradable- Microbial PoUnners, EA Dawes, ed . , 
pplSl-202 (1990)). Recursive sequence recombination of 
these genes or pathwa\'5 singly or in combination into a 
heterologous host will allow the production of a variety 
cf polvTP.ers with differing properties, including 
variation of the m.onomer subunit ratios in the polyrr.er. 
Another pol^/rr.er whose synthesis may be manipulated by 
recursive sequence recombination is cellulose. The genes 
for cellulose biosynthesis have been cloned from 
Agrobacterium cumefaclens (Mat thysse , A.G. et. al. J . 
Sacteriol . 177:10 6 9-1075 {19?S)). Recursive sequence 
reconrJDination of this biosynthetic pathway could be used 
either to increase synthesis of cellulose, or to produce 
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mutants in which alternative sugars are incorporated into 
the polymer. 

G . Carotenoids 

Carotenoids are a family of over 600 terpenoids 
produced in the general isoprenoid biosynthetic pathway 
by bacteria, fungi and plants (for a review, see 
Armstrong, J. Bact . 176:4795-4802 (1994)). These 
pigments protect organisms against photooxidative damage 
as well as functioning as anti-tumor agents, free 
radical-scavenging ant i -oxidants , and enhancers of the 
immune response. Additionally, they are used 
commercially in pigmentation of cultured fish and 
shellfish. Examples of carotenoids include but are not 
limited to myxobacton, spheroicene, spheroidenone , 
lutein, astaxanthin, violaxanthin , 4 -ketorulene , 
nyxoxanthrophyll , echinencne, lycopene, zeaxanthin and 
its mono- and di- clucosices, a-, G~ , y- and <5-carotene, 
p-crypcoxanthin monoclucos ice and neoxanthin. 

Carotenoid sv^.thesis is catalyzed by relatively 
small numbers of clustered genes: 11 different genes 
within 12 kb of DN'A frox :\yxccoccus xanchus (Botella et 
al. Eur. J. Siochem. 2 3 3:238-^2-1 3 (1995)) and 3 genes 
wichin 9 kb of DN'A fror. PJ^.odobacter sphaeroices (Lang et . 
al. J. 3ac" ■ 177:2064-2073 (i5?5;). In some 
r.i croorgan i sms , such as Therrr.iis zherrr.ophi lus , these genes 
are plasmid-borne (Tabata et al . FES5 Letts 341:251-255 
(1954)). These features n^.ake carotenoid synthetic 
pathways especially attractive candidates for recursive 
sequence recombination. 

Transfer of soxe carctenoid genes into 
heterologous organisms results m expression. For 
example, genes from Erwina 'jrezovora and Haema tococcus 
pluvial is will function together in E. coii (Kajiwara et 
ai. Plant Mol . Siol . 29:343-352 (1995)). £. herbicola 
genes v;ill function in P.. sp'naeroides (Hunter e t al , J . 
5act . 176:3692-3697 (1994)). However, some other genes 
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do not; for example T-R. capsulatus genes do not direct 
carotenoid synthesis in E. coli (Marrs, J. Bact. 
146:1003-1012 (1981) ) . 

In an ernbodiment of the invention, the 
recursive sequence recombination techniques of the 
invention can be used to generate variants in the 
regulatory and/or structural elements of genes in the 
carotenoid synthesis pathway, allowing increased 
expression in heterologous ' hosts . Indeed, traditional 
techniques have been used to increase carotenoid 
production by increasing exoression of a ra^^ T;rT>,-H;„^ 
enzyme in Thermus thermophilus (Koshino et al . AdoI . 
Environ. Micro, 59:3150-3153 (1993)). Furthermore, 
mutation of regulatory genes can cause constitutive 
expression of carotenoid s^-nthesis in accinomycetes , 
where carotenoid photoinducibi 1 i ty is otherwise unstable 
and lost at a relatively high frequency in some species 
(Kato et al. Mol . Gen. Gene:. 247:387-390 (1995)). These 
are both mutations that can be obtained by recursive 
sequence recorrj^inat ion . 

The recursive seq^^e^ce recombination techniques 
or the invention as described above can be used to evolve 
one or r,ore carotenoid synthesis genes in a desired host 
without Che need for analysis of regulatory mechanisms. 
Since carotenoids are colored, a colorimetric assay in 
microliter places, cr even cn growth media plates, can be 
used for screening fcr increased production. 

In addition to increasing expression of 
carotenoids, carotenogenic bios>Tithet ic pathways have the 
potential to produce a wide diversity of carotenoids, as 
the enzyrr.es involved appear to be specific for the type 
of reaction they will catalyze, but not for the substrate 
that they modify. For example, two enzyrr.es from the 
marine bacterium Agrobac cerin" auran ciacurr) (Crtv; and 
CrtZ) synthesize six different >e tocarot enoids from 
/3-carotene (Misawa et al, J. Bact. 177:6576-6584 (1995)). 
This relaxed substrate specificity means that a diversity 
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of substrates can be transformed into an even greater 
diversity of products. Introduction of foreign 
carotenoid genes into a cell can lead to novel and . 
functional carotenoid-protein complexes, for example in 
photosynthetic complexes (Hunter et al. J.Bact . 176:3692 
3697 (1994)). Thus, the deliberate recombination of 
enzymes through the recursive sequence recombination 
techniques of the invention is likely to generate novel 
compounds. Screening for such compounds can be 
accomplished, for example, by the cell 

competition/survival techniques discussed above and by a 
colorimecric assay for pigmenced compounds. 

Ajiocher method of identifying new compounds is 
to use standard analytical techniques such as mass 
spectroscopy, nuclear magnetic resonance, high 
performance liquid chromatccraphy , etc. Recoxbinant 
microorganisms can be pooled and extracts or media 
supernatants assayed from these pools. Any positive poo] 
can then be subdivided and ihe procedure repeated until 
the single positive is identified (" s ib- select ion" ) . 

H . Indigo Biosynthesis 

nany dyes, i.e. agencs for imparting color, ar 
specialty chemicals witih significant markets. As an 
example, indigo is currently produced chemically. 
However, nine genes have bee.n corrjoi ned in E. coi i to 
allow the synthesis of indigo from glucose via the 
tryptophan/indole pathway (Murdock et al . Bio/Technoloov 
11:381-386 (1993)). A nurriDer of m.anipulat ions were 
performed to optimize indigo synthesis: cloning of nine 
genes, modification of the f erm.enta t ion medium and 
directed changes in two cperons to increase reaction 
rates and catalytic activities of several enzymes. 
Nevertheless, bacterially produced indigo is not 
clirrently an economic proposition. The recursive 
sequence recombination techniques of the instant 
invention could be used to optimize indigo synthesizing 
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enzyme expression levels and catalytic activities, 
leading to increased indigo production, thereby making 
the process commercially viable and reducing the 
environmental impact of indigo manufacture. Screening 
for increased indigo production can be done by 
colorimetric assays of cultures in microtiter plates. 

I . Amino Acids 

Amino acids of particular commercial importance 
include but are not United to phenylalanine, monosodium 
glutarr.ate, glycine, lysine, threonine, tryptophan and 
methionine. Backman et al . ( Ann. NY Ar^H q,-,- 589:16-24 
(1990)) disclosed the enhanced production of 
phenylalanine in E. coll via a systematic and downstream 
strategy covering organism selection, optimization of 
biosynchetic capacity, and development of fermentation 
and recovery processes. 

As described i.-. Si~pson et al . Qiochem f^nr 
Trans, 23:381-387 (::35);. current work in the field of 
20 amino acid production is focused on understanding the 

regulation of these pathways in great molecular detail. ' 
The recursive sequs.-.ce recombination techniques of t.he 
instant invention wo'_id obviate the need for this 

analysis to obtai." ba-t"^' ? i q---3i"cr i.-irv, • 

s^.aiws wit.n higner secreted 

25 ammo acid v^'^'ir^^ ^r-i--" „_„^.. ■ -, , , 

/ .......no aci^ proouction could be 

optimized for expressicr. using recursive sequence 
recombination cf the amino acid s-./nthesis and secretion 
genes as well as enzy;nes at the regulatory 
phosphoenolpyruvate branchpoint, from such organisms as 
Serratia marcescens, Bacillus, and the Coryn eba c c eri u.'. 
-Brevibacteriu.Tj group. in some embodiments of the 
invention, screening for enhanced production is 
preferably done in micro:. ter wells, using chemical tests 
well known in the art that are specific for the desired 
35 amino acid. Screening/selection for amino acid synthesis 

can also be done by using auxotrophic reporter cells that 
are themselves unable to synthesize the amino acid in 
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question. If these reporter cells also produce a 
compound that stimulates the growth of the amino acid 
producer (this could be a growth factor, or even a 
different amino acid) , then library cells, that produce 
5 more amino acid will in turn receive more growth 
stimulant and will therefore grow more rapidly. 

J . Vitamin C synthesis 

L-Ascorbic acid (vitamin C) is a commercially 

10 important vitamin with a world production of over 35,000 

tons in 1984. Most vitamin C is currently manufactured 
chemically by the Reichstein process, although recently 
bacteria have been engineered chat are able to transform 
glucose to 2 , 5 - keto-gluconic acid, and that product to 2- 

15 keto-L- idonic acid, the precursor to L-ascorbic acid 

(Boudrant, Enz-vrr.e Microb. Techr.ol . 12:322-329 (1990)). 

The efficiencies of these enzymatic steps in 
bacteria are currently low. Using the recursive sequence 
recombination techniques of the instant invention, the 

20 genes can be genetically engineered to create one or more 

ope r ens followed by expression optimization of such a 
hybrid L-ascorbic acid synthetic pathway to result in 
cor7iT.erc ial ly viable microbial vicamin C biosynthesis. In 
some enbodimenc s , screening for enhanced L-ascorbic acid 

25 production is preferably done in r.icrotiter plates, using 

assays well known in che art 

'^-'I ■ Modification of Cell Properties. 

Although not strictly examples of manipulation 

30 of intermediary metabolism, recursive sequence 

recomx)inat ion techniques can be used to improve or alter 
other aspects of cell properties, from growth rate to 
ability to secrete certain desired compounds to ability 
to tolerate increased t err.pera ture or other environmental 

35 Stresses. Som.e examples of traits engineered by 

traditional methods include expression of heterologous 
proteins in bacteria, yeast, and other eukaryotic cells, 
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antibiotic resistancfe, and phage resistance. Any of 
these traits is advantageously evolved by the recursive 
sequence recombination techniques of the instant 
invention. Examples include replacement of one nutrient 
5 uptake system (e.g. ammonia in Methylophilus 

wethylotrophus) with another that is more energy 
efficient; expression of haemoglobin to improve growth 
under conditions of limiting oxygen; redirection of toxic 
metabolic end products to less toxic compounds; 

10 expression of genes conferring tolerance to salt, drought 
and toxic compounds and resistance to pathogens, 
antibiotics and bacteriophage, reviewed in Cameron et . 
al . AddI Biochem Biotechnol , 38:105-140 (1993). 

The heterologous genes encoding these functions 

15 all have the potential for further optimization in their 
new hosts by existing recursive sequence recorrlDinat ion 
technology. Since these functions increase cell growch 
rates under the desired growth conditions, optimization 
of the genes by evolution si"ply involves recorruDining the 

20 DKA recursively and selecting the recombinants that grow 
faster with limiting oxygen, higher toxic compound 
concentration, or whatever is the appropriate growth 
condi::ion for the parameter b^eing improved. 

Since these functions increase cell growth 

25 rates under the desired growth conditions, optimization 
of zhe genes by "evolucion'' can simply involve 
"shuffling" the DN'A and selecting the recorJoinants that 
grow faster with limiting oxygen, higher toxic compound 
concentration or whatever restrictive condition is being 

3 0 ove rcome . 

Cultured mammalian cells also require essential 
amino acids to be present m the growth medium. This 
requirement could also be circumvented by expression of 
heterologous metabolic pathways that synthesize these 
35 .amino acids (Rees et al. Biotechnology 8:629-633 (1990). 
Recursive sequence recombination would provide a 
mechanism for optimizing the expression of these genes in 
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mammalian cells. Once again, a preferred selection would 
be for cells that can grow in the absence of added amino 
acids . 

Yet another candidate for improvement through 
the techniques of the invention is symbiotic nitrogen 
fixation. Genes involved in nodulation (nod, ndv) , 
nitrogen reduction (nif , fix) , host range determination 
' (nod, hsp) , bacceriocin production (tfx) , surface 
polysaccharide synthesis (exo) and energy utilization 
(d'ct, hup) which have been identified (Paau, Biotech. 
Adv. 9:173-184 (1991) ) . 

The main function of recursive sequence 
recombination in this case is in improving the survival 
of strains chat are already known to be better nitrogen 
fixers. These strains tend to be less good at competina 
with strains already present in the environment, even 
though they are better at nitrogen fixation. Targets for 
recursive sequence recor?jDinat ion such as nodulation and 
host range determination csnes can be modified and 
selected for by their abili::y to crow on the new host. 
Similarly any bactericcin or energy utilization genes 
that will iTiorove the co~pe t i t iveness of the strain will 
also result in greater growth rates. Selection can 
simply be performed by s'jbjectinc the target genes to 
recursive sequence re ccrrJoina t ion and forcing the 
inoculant to compete with wild tvpe nitrogen fixing 
bacteria. The better the nitrogen fixing bacteria grow 
in the new host, the more copies of their recombined 
genes will be present for the next round of 
recombination. This growth rate differentiating 
selection is described above in detail. 

VI . Biode tec tor 9 /Biosensors 

Bioluminescence or fluorescence genes can be 
used as reporters by fusing therr. to specific regulatory 
genes (Cameron et. al. Aool Biochem Biotechnol . 38:105- 
140 (1993)). A specific example is one in which the 
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lucif erase genes luxCDABE of Vibrio fischeri were fused 
to the regulatory region of the isopropylbenzene 
catabolism operon from Pseudomonas putida RE204. 
Transformation of this fusion construct into E. coli 
resulted in a strain which produced light in response to 
a variety of hydrophobic compound such as substituted 
benzenes, chlorinated solvents and naphthalene 
(Selifonova et. al . , AddI Environ Micrnhini 62:778-783 
(1996)). This type of construct is useful for the 
detection of pollutant levels, and has the added benefit 
of only measuring those pollutants that are bioavailable 
(and therefore potentially toxic) . Other signal 
molecules such as jellyfish green fluorescent protein 
could also be fused to ger.etic regulatory regions that 
15 respond to chemicals in the environment. This should 

allow a variety of molecules to be detected by their 
ability CO induce expressicr. of a protein or proteins 
which result in light, fluorescence or some other easily 
detected signal. 

Recursive sequence recombination can be used in 
several ways to modify this t^/pe of biodetection system. 
It can be used to increase -he amplitude of the response, 
for example by increasing zhe 'fluorescence of the green 
fluorescent protein. Recursive sequence recombination 
could also be used to increase induced expression levels 
or catalytic activities of other signal -generating 
systems, for exa~;ple of the luciferase genes. 

Recursive sequence recombination can also be 
used to alter the specificity of biosensors. The 
regulatory region, and transcriptional activators chat 
interact with this region and with the chemicals chat 
induce transcription can also be shuffled. This should 
generate regulatory systems m which transcription is 
activated by analogues of the normal inducer, so that 
biodetectors for different chemicals can be developed. 
In this case, selection would be for constructs that are 
activated by the (new) specific chemical to be detected. ' 
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Screening could be cfone simply with fluorescence (or 
light) activated cell sorting, since the desired 
improvement is in light production. 

In addition to detection of environmental 
pollutants, biosensors can be developed that will respond 
to any chemical for which there are receptors, or for 
which receptors can be evolved by recursive sequence 
recombination, such as hormones, growth factors, metals 
and drugs. These receptors may be intracellular and 
direct activators of transcription, or they may be 
membrane bound receptors that activate transcription of 
the signal indirectly, for example by a phosphorylation 
cascade. They may also not act on transcription at all, 
but may produce a signal by some pose - transcript ional 
modification of a component of the signal generating 
pathway. These receptors r.ay also be generated by fusing 
domains responsible for binding different ligands with 
different signaling domains. Again, recursive sequence 
recombination can be used ::o increase the amplitude of 
the signal generated to cpcir.ize expression and 
functioning of ch irp.eric receptors, and to alter the 
specificity of the chemicals detected by the receptor. 

The foliowir.g exarr.ples are offered by way of 
illustration, not by way of limitation. 

EXAMPLES 

I . Alteration of enzvme activity and specificity. 

In this example, recursive sequence 
recoujoina t ion techniques of the instant invention were 
used to expand the range of substrates efficiently 
hydrolyzed by E. coli /3-caiacto3idase . The goal was to 
evolve wild type E. coli f -calaccosidase into a 
fucosidase. The enzN^e shrv.-ed very weak activity with 
both p-ni trophenyl - /3-D - fucopyrano side and 

'o-ni trophenyl-/5-D - f ucopyra:vD5ide (estimated respectively 
as 80- and 160-fold less efficient than for 
p -ni trophenyl - /3 - D-galactopyranoside ) . 
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To increase the activity of E. coli 
/3-galactosidase against these f licopyranoside derivatives, 
a lacZ gene (a 3.8 kb Hind III -BamHI fragment from 
plasr-.id pCHllO, Pharmacia) encoding E. coli 
^-galactosidase was subcloned into plasmid pl8SFI-BLA-SFI 
(Stemmer, Nature , 370:389-391 (1994)). The resulting 
plasmid, pl8-lacZ, was used for recursive sequence 
recombination and mutant screening. 

Purified plasmid pl8-lac2 (4-5 (ig) was used 
directly for DNase I fragmentation. Fragments with sizes 
between 50 and 200 bp were purified from a 2V agarose gel 
and used for reasserrily PCR (Stemmer, Nature 370:389-391 
(1994)). Assembly reactions used Tth polymerase (Perlcin 
Elmer) in the manufacturer's supplied buffer. The PCR 
program for asseTily was as follows: 94 °C, 2 min., then 
40 cycles of 94°C for 30 sec.; 55«C for 3 sec.; 72'C for 
1 min. 4 5 sec. per cycle; the.-, finally 72°C for 5 min. 

This reactic.-. was diluted 100-fold into a 
standard PCR reactio.-. usi.-.g the 4 0mer primers p50F 
2 0 5 ' - AGCGCCCA.^T.^CGCA.iJ-.CCGCCTCTCCCCGCGCGTTGGCC - 3 ' ' 

and pR34 5 ' - CTATGCGGC.-.TC.\G.^GC.nG.nTTGTACTGAGAGTGCACCAT- 3 ' . 
This resulted in amplif icatio.n of both the desired DN^A 
band (about 4 kb -n size) as well as ' two smaller sized 
products (about 600bp and lOObp ba.nds ) . The PCR products 
25 were digested with Bar^l and Kind III and the correct 
size product was clc.-ed into 3a~:-il -Hindi 1 1 digested 
?1S-Iacz. The resultir.g plasmid containing a pool of 
recombined lacZ mutants was plated out on LS plates 
supplemented with kanar.ycin and 

5-bromo-4-chloro-3-indclyl-/5-D-fucopyranoside (X-fuco) . 
Plates were incubated at 37°C for 20 hours and screened 
for .colonies with slight blue tint, indicating hydrolysis 
of the .X-fuco. Plasmid d;;.:. was prepared from positive 
colonies and the procedure was repeated. Thus, six 
rounds of recursive sequence recombination produced a 
ten-fold increase in X-fuco hydrolysis activity. 
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^ I . Evolution of an Entire Metaibolic Pathway 

As an example of evolution of an entire 
metabolic pathway, the recursive sequence recombination 
techniques of the invention were used to modify a plasmid 
encoding resistance to mercury salts. This plasmid, as 
disclosed by Wang et al. ( J. Bact . 171:83-92 (1989)) 
contains at least 8 genes within 13.5 kb of Bacillus DNA 
inserted in the cloning vector pUC9 . The recursive 
sequence recombination protocol used for this plasmid was 
as follows. 

Plasmid DNA {at 130 ^g/ml) was digested with 
0.09 U/ml DNAse in 50 rr-M Tris-Cl, pH 7.4, 10 mM MnClj, for 
10 minutes at 25®C. DNA fragments were not size- 
selected, but were purified by phenol extraction and 
ethanol precipi t.at ion . The assembly reaction was 
performed using Tth polymerase (Perkin Elmer) using the 
manufacturer's supplied buffer, supplemented with the 
following: 7.5V polyethylene glycol, 8000 NfW; 3 5 m>l 
tet ramethylammonium chloride; and 4 U/ml Pwo (Eoehringer 
Mannheim), ?f u (Stratagene) , Ver.: (New England Biolabs) , 
Deep Vent {New England Bioiabs) , Tf 1 (Promega) or Tli 
(Promega) thermostable DNA polv—erases . DNA fragments 
were used at arcur.d 10 ,t:g/ml . 

The PCR program for asseTbly was as follows: 
S^'i^C for 2 0 sec., then 40 cycles of 94°C for 15 sec., 
40 = C for 30 sec., for 30 sec. * 2 sec. /cycle, and 

finally 72^C for 10 min . 

The recombinant plasmid was then am.plified in 
three fragments by using primers flanking the three 
relatively evenly spaced AlwNI restriction sites 
contained in the plasmid. The sequences of these primers 
were : 

1) 5 ' - CAGGACTTATCGCCACTGGCAGC - 3 ' 

2 ) 5 ' - CTGGCTCTGCTAATCCTGTTACC - 3 ' 

3 ) 5 ' - GCAT.ATTATGAGCGTTTAGGGTTAATTCC- 3 ' 
4 ) 5 ' - GGGTATCGTTTTTCCGT ACGTTC - 3 ' 

5 ) 5 ' - GTTGA-AG.AGGTG AAGAAAGTTCTCC - 3 ' 
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6 ) 5 ' - GTTCGTCGATTTCCACGCTTGGC - 3 ' . 

Three fragments were amplified using primers 
1+4 (6 kb fragment), 2+5 {4 kh fragment) and 3+6 (6 kb 
fragment) . These were then digested with AlwNI, gel 
purified and ligated together. As AlwNI is a 
non-palindromic cutter, the plasmid could only reassemble 
in the correct (original) order. The resultant plasmids 
were transformed into E. coll strain DHIOB (Gibco BRL) 
and selected on nutrient agar containing ampicillin 
50^g/ml and increasing concentrations of mercuric 
chloride (ioOmM to IOOOmM) or phenylmercuric acetate 
(50mM to 400/iM) Thus, in 2 rounds of recursive sequence 
recombination the tolerance of E. coli to these compounds 
increased by a factor of 10 (from about 100 to about 
1 , 000 ^M) . 

Recursive secaen ce Recombination of a Family nf 
Related Enzvrp.ef; 

In this exa-.ple nucleotide sequences were 
recombined between four hc~Dlogou3 /3- lactamases from C. 
freundii, E. cloacae. :■: . pnenr.onia , and Y. 
enterocolicica. The four g'^nes were synthesized from 
oligonucleotides as described in Stemmer, et al. Gene 
164:49-53 (1995). Briefly, the entire coding sequences 
of the genes were sy-nthesized as overlapping 50-ner 
oligonucleotides on a co-mercial oligonucleotide 
synthesizer. The oligo.nuclec: ides were then asserPi^led 
into full length genes by a standard recursive sequence 
recoxu3inaticn reaction, followed by anpl i f icat ion using 
primers common to all four genes. Oligonucleotides were 
designed to give optimal E. coli codon usage in the 
synthetic genes with the goal of increasing the homology 
to increase the freq^uency cf recombination, and the same 
.5' and 3' terminal sequences. After assembly of the 
genes and selection for active clones, which is optional, 
they were DNase treated to produce fragments from 50 to 
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200 bp in length. The fragments were dissolved at 100 
Aig/ml in 15 ^1 of Klenow (DNA polymerase I large 
fragment) buffer (New England Biolabs) and subjected to 
manual PCR as follows: 15 cycles of 95°C for 1 min.; 
.freeze on dry ice and ethanol; warm to 25°C and add 2 ^1 
of Klenow (1 U/^1) in Klenow buffer; incubate for 2 min 
at 25*=^C. 

A 5 ;xl aliquot of the manual PCR reaction was 
then diluted 6 -fold into a standard Taq reaction mix 
(without oligonucleotide primers) and assembled using a 
standard PCR program consisting of 30 cycles of 94 °C for 
30 sec, 40°C for 30 sec, and 72°C for 30 sec 

A 4, 8 or 16 /il aliquot of this second PCR 
reaction was then diluted into a standard Taq reaction 
nix containing oligonucleotide priners that prime on 
sequences contained in all four /3-lactanase genes 
5' -AGGGCCTCGTGATACGCCTATT-3 ' and 

5' -ACGAAAACTCACGTTAAGGGATT-3 ' . Full-length product was 
amplified using a standard PCR program consisting of 25 
cycles of 94°C for 30 sec. nS^C for 30 sec, 72°C for 45 
sec . 

This procedure produced hybrid ^3- lactamase 
genes whose activities can be tested against antibiotics 
including but not lirr.ited to arr.pici 1 1 in , carbenic i 1 1 in , 
cefotaxime, ce f oxi *: ine , cloxaci 11 in , ceftazidime; 
cepha lor idine and r.oxaiactam, to determine the 
speci f ici c ies of the hybrid enzymes so created, 
r.oxalactam was chosen as the test antibiotic for hybrid 
genes. The best of the original /3- lactamase genes used 
in this study conferred resistance Co 0.125 pg/ml of 
moxalactam. After the first round of recursive sequence 
recombination hybrid genes were isolated that conferred 
resistance to 0.5p:g/ml moxalactam, yielding a 4-folG 
increase . 
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IV. Shuffling to Generate Improved Arsenate 

Detoxification Bacteria 
Arsenic detoxification is important for 
goldmining of arsenopyrite containing gold ores and other 
uses, such as environmental remediation. Plasmid pGJ103 , 
containing an operon encoding arsenate detoxification 
operon (Wang et al . (1989) Bacteriol . 171: 83, 
incorporated herein by reference) , was obtained from 
Prof. Simon Silver (U. of Illinois, Chicago, XL), 
E. coli TGI containing pJG103, containing the pl258 ars 
operon cloned into puC15, had a HIC (minimum inhibitory 
concentration) of 4 ^g/ml on LB amp plates. The whole 
5.5 Jcb plasmid was fragmented with DNAse I into fragments 
of 100-1000 bp, and reassembled by PGR using the Perkin 
Elmer XL-PCR reagents. After assembling, the plasmid was 
digested with the unique restriction enzyme BamHI . The 
full length monomer was purified from the agarose gel, 
iigated and electroporate:: into £. coli TGI cells. The 
tranformed cells were plates on a range of sodium 
arsenate concentrations {2, A, B, 16 ttuM in round 1), and 
approx. 1000 colonies fror. the plates with the highest 
arsenate levels were pooled by scraping the plates. The 
cells were grown: in liquid in the presence of the same 
concentration of arsenate, and plasmid was prepared from 
tnis culture. Rounz 2 and 3 were identical to round 1, 
except that the cells were plated at higher arsenate 
levels. S, 16, 22, 5-; r.M were used for round 2; and 32, 
128, 256 m-M were used for selection of round 3. 

The best mutants crew overnight at up to 128 
arsenate (MIC=255), a 64-folc improvement. One of the 
improved strains showed that the TGI (wildtype pGGJ103) 
grew in liquid at up to :: rr:-:, whereas the shuffled 
TGI (mutant pGJ103) crew ^i: -jp to 150 rr^X arsenate 
concentration . 

PGR program fcr the assembly was 94 °C 20s, 
50x(94°G 15s, 5G^G 1 min, 72°C 30s+2s/cycle) , using a 
circular PC'R format without orimers. 
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Four cycles of the process resulted in a 50- 
100-fold improvement in the resistance to arsenate 
conferred by the shuffled arsenate resistance operon; 
bacteria containing the improved operon grew on medium 
containing up to 500mM arsenate. 

Although the foregoing invention has been 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it will 
be obvious that certain changes and modifications may be 
practiced within the scope of the appended claims. 

All references cited herein are expressly 
incorporated in their entirety for all purposes. 



wo 97/35966 



PCT/US97/04715 



75 



WHAT 15 CLAIMED T.S : 

1. A method of evolving a biocatalytic 
activity of a cell, comprising: 

(a) recombining at least a first and second 
DMA segment from at least one gene conferring ability to 

_ catalyze a reaction of interest, the segments differing 
from each other in at least two nucleotides, to produce a 
library of recombinant genes; 

(b) screening at least one recombinant gene 
from the library that confers enhanced ability to 
catalyze the reaction of interest by the cell relative to 
a wildtype form of the gene; 

(c) recombining a- least a segment from the at 
least one recombinant gene wich a further DNA segment 
from the at least one gene, the same or different from 
the first and second segments, to produce a further 
library of recombinant genes; 

(d) screening at least one further recombinant 
gene from the further library of recombinant genes that 
confers enhanced ability to catalyze the reaction of 
interest by the cell relative to a previous recom.binant 
gene ; 

(e) repeating (c) and {c) , as necessary, until 
the further rezoTj^inant gene confers a desired level of 
enhanced ability to catalyze the reaction of interest bv 
the cell. 

2. The m.ethoG of claim 1, wherein the 
reaction of interest is the abilitv to utilize a 
substrate as a nutrient source 

3. The methcc of claim 1, wherein the 
reaction of interest is the ability to catabolize a 
comoound . 
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1 , 4. The method of claim 1, wherein the 

2 reaction of interest is the ability to detoxify a 

3 compound . 

1 5. The method of claim 1, wherein the 

2 reaction of interest is the ability to synthesize a 

3 compound of interest. 

1 6. The method of claim 5, wherein the 

2 compound is an antibiotic. 

1 7. The method of claim 5, wherein the 

2 compound is an amino acid. 

1 .8. The method of claim 5, wherein the 

2 compound is a polymer . 

1 9, The method of claim 5, wherein the 

2 compound is a carotenoic . 

1 10. The method of claim 5, wherein the 

2 compound is vitamin C, 

1 11 . The method of claim 5, wherein the 

2 compound is indigo. 

1 12. The method of claim 1, wherein at least 

2 one recorrjoining step is performed in vitro, and the 

3 resulting library of recombinants is introduced into the 
cell whose biocatalytic activity is to be enhanced 

5 generating a library of cells containing different 

6 recombinants. 

1 13. The method of claim 12, wherein the in 

2 vitro recomLbininc step com.prises: 

3 cleaving the first and second segments into 

4 fragments; 
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5 mixing and denaturing the fragments; and 

6 incubating the denatured fragments with a 

7 polymerase under conditions which result in annealing of 

8 the denatured fragments and formation of the library of 

9 recombinant genes. 

1 14. The method of claim 1, wherein at least 

2 one recombining step is performed in vivo. 

1 . 15. The method of claim 1, wherein the 

2 recombining step is performed in the cell whose 

3 biocatalytic activity is to be enhanced. 

1 16. The method of claim 1, wherein at lease 

2 one DiMA segment comprises a cluster of genes collectively 

3 conferring ability to catalyze a reaction of interest. 

1 17. A rr.echod c: evolving a gene to confer 

2 abilicy to cacalyze a reacrio" on interest, the method 

3 comprising : 

4 (1) recorJ^'i.-i.-g a" least first and second DNA v 

5 segments from a: lease c.-.e gene conferring ability to 

S catalyze a reaction of interest, the segments differing 

7 from each other in ae least two nucleotides, to produce a 

3 library of re ccrr,b inane genes; 

9 (2) screening ac least one re corrJo inane gene 

10 f rom the library e h a e confers enhanced ability to 

11 catalyze a reaction of ineerese relative eo a wildtype 

12 form of the gene ; 

1^ (3) recombining ae lease a segment from the at 

14 least one recombinant gene wieh a further DNA segment 

15 from the at least one gene, the same or different from 

16 the first and second secmenes, eo produce a further 

17 library of recombinant genes; 

18 , (4) screening ae lease one further recombinant 

19 gene from the further library of recombinant genes that 
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20 - confers enhanced ability to catalyze a reaction of 

21 interest relative to a previous recombinant gene; 

22 (5) repeating (3) and (4), as necessary, until 

23 the further recombinant gene confers a desired level of 

24 enhanced ability to catalyze a reaction of interest. 

1 18. A method of generating a new biocatalytic 

2 activity in a cell, comprising: 

3 (1) recombining at least first and second DNA 

4 segments from at least one gene conferring ability to 

5 catalyze a first reaction related to a second reaction of 

6 interest, the segments differing from each other in at 

7 least two nucleotides, to produce a library of 

8 recombinant genes; 

9 (2) screening at least one recombinant gene 

10 from the library that confers a new ability to catalyze 

11 the second reaction of interest; 

12 (3) recoTJDinin^ at least a segm.ent from at 

13 least one reconr±)inant gene wich a further DNA segment 

14 from the at least one gene, the same or different from 

15 the first and second segments, to produce a further 

16 library of recorrJDinant genes; 

(4) screening at least one further recombinant 

18 gene from the further library of recombinant genes that 

15 confers enhanced ability zq catalyze the second reaction 

20 of interest in the cell relative to a previous 

21 recombinant gene; 

22 (5) repeating (3) and (4), as necessary, until 

23 the further recombinant gene confers a desired level of 

24 enhanced ability to catalyze the second reaction of 

25 interest in the cell, 

1 19. A modified fcrrr. of a cell, wherein the 

2 m.odif icatio?! comprises a rerabolic pathway evolved by 

3 recursive sequence zecor:joL:~.az ion . 
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20. A method of optimizing expression of a 
2 gene product, the method comprising: 

(1) recombining at least first and second DNA 

4 segments from at least one gene conferring ability to 

5 produce the gene product, the segments differing from 
S each other in at least two nucleotides, to produce a 
7 library of recombinant genes; 

(2) screening at least one recombinant gene 
from the library that confers optimized expression of the 
gene product relative to a wildtype form of the gene; 

•^■^ f^) recombining at least a segment from the at 

12 least one recombinant gene with a further DNA segment 

13 from the at least one gene, the same or different from 

14 the first and second segments, to produce a further 

15 library of recombinant genes; 

(4) screening at least one further recombinant 
gene from the further library of recomi^inant genes that, 
confers optimized ability zo produce the gene product 
relative co a previc:- reccT.binant gene; 

(5) repeating {3) and (4), as necessary, until 



8 
9 
10 



17 
18 
19 
20 



21 th^ 



22 



1 



rurtner recor?±>inant cene confers a des i red " leve 1 of 



optimized ability to express the gene product. 

21. The r.ethcd of claim 20, wherein the at 
least one gene encodes tne cer.e product. 

22. The method of claim 20, wherein the at 
least one gene is a vector comprising a gene encoding the 
gene oroduci . 



23. The method of claim 20, wherein at least 

in vi vo . 



2 one recombining step is performed 



24. The method of claim 23, wherein the 
recombining step is perforr.eii m a host cell wherein th^ 



^ gene product is exoressed 
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1 - 25. The method of claim 20, wheriin the at 

2 least one gene is a host cell gene and wherein the host 

3 cell gene does not encode the gene produce. 

1 26. The method of claim 20, wherein 

2 optimization results in increased expression of the gene 

3 product. 

1 27. A method of evolving a biosensor for a 

2 compound A of interest, the method comprising: 

3 (1) recombining at least first and second DNA 

4 segments from at least one gene conferring ability to 

5 detect a related compound 3, the segments differing from 

6 each other in at least two nucleotides, to produce a 
•7 library of recombinant genes; 

8 (2) screening at least one recorrJoinant gene 

9 from the library that confers optimized ability to detect 

10 compound A relative to a wildt^-pe form of the gene; 

11 (3) recor^Dininc at least a segment from the at 

12 least one recorr±)inant gene with a further DMA segment 

13 from the at least one gene, the same or different from 

14 the first and second segments, to produce a further 

15 library of reccTjDinant genes; 

16 (4) screening at least one further recor-ibinant 

17 gene from the further library of recomJoinann genes that 

18 confers optimized ability to detect compound A relative 

19 to a previous recombinant: gene; 

20 (5) repeating (3) and (4), as necessary, until 

21 the further recombinant gene confers a desired level of 

22 optimized ability to detect compound A. 

1 28. The method of claim 27, wherein 

2 optimization results in increased amplitude of response 

3 by the biosensor. 

1 29. The method of claim 27, wherein compound A 

2 and comoound 3 are different. 
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1 30. The method of claim 27, wherein compound A 

2 and compound B are identical. 
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